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Series Preface 



Mathematics is playing an ever more important role in the physical and 
biological sciences, provoking a blurring of boundaries between scientific 
disciplines and a resurgence of interest in the modern as well as the classical 
techniques of applied mathematics. This renewal of interest, both in re- 
search and teaching, has led to the establishment of the series Texts in 
Applied Mathematics (TAM). 

The development of new courses is a natural consequence of a high level 
of excitement on the research frontier as newer techniques, such as numeri- 
cal and symbolic computer systems, dynamical systems, and chaos, mix 
with and reinforce the traditional methods of applied mathematics. Thus, 
the purpose of this textbook series is to meet the current and future needs 
of these advances and to encourage the teaching of new courses. 

TAM will publish textbooks suitable for use in advanced undergraduate 
and beginning graduate courses, and will complement the Applied Mathe- 
matical Sciences (AMS) series, which will focus on advanced textbooks and 
research- level monographs. 



Pasadena, California 
Providence, Rhode Island 
College Park, Maryland 



J.E. Marsden 
L. Sirovich 
S.S. Antman 




Preface to the Second Edition 



This edition contains a significant amount of new material. The main rea- 
son for this is that the subject of applied dynamical systems theory has 
seen explosive growth and expansion throughout the 1990s. Consequently, 
a student needs a much larger toolbox today in order to begin research on 
significant problems. 

I also try to emphasize a broader and more unified point of view. My 
goal is to treat dissipative and conservative dynamics, discrete and con- 
tinuous time systems, and local and global behavior, as much as possible, 
on the same footing. Many textbooks tend to treat most of these issues 
separately (e.g., dissipative, discrete time, local dynamics; global dynamics 
of continuous time conservative systems, etc.). However, in research one 
generally needs to have an understanding of each of these areas, and their 
inter-relations. For example, in studying a conservative continuous time 
system, one might study periodic orbits and their stability by passing to a 
Poincare map (discrete time). The question of how stability may be affected 
by dissipative perturbations may naturally arise. Passage to the Poincare 
map renders the study of periodic orbits a local problem (i.e., they are fixed 
points of the Poincare map), but their manifestation in the continuous time 
problem may have global implications. An ability to put together a “big 
picture” from many (seemingly) disparate pieces of information is crucial 
for the successful analysis of nonlinear dynamical systems. 

This edition has seen a major restructuring with respect to the first 
edition in terms of the organization of the chapters into smaller units with 
a single, common theme, and the exercises relevant to each chapter now 
being given at the end of the respective chapter. 

The bulk of the material in this book can be covered in three ten week 
terms. This is an ambitious program, and requires relegating some of the 
material to background reading (described below) . My goal was to have the 
necessary background material side-by-side with the material that I would 
lecture on. This tends to be more demanding on the student, but with the 
right guidance, it also tends to be more rewarding and lead to a deeper 
understanding and appreciation of the subject. 

The mathematical prerequisites for the course are really not great; ele- 
mentary analysis, multivariable calculus, and linear algebra are sufficient. 
In reality, this may not be enough on its own. A successful understanding 
of applied dynamical systems theory requires the students to have an inte- 
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grated knowledge of these prerequisites in the sense that they can fluidly 
manipulate and use the ideas between the subjects. This means they must 
possess the quality often referred to as “mathematical maturity.” A study 
of dynamical systems theory can be a good way to obtain this. In addi- 
tion, an ordinary differential equations course from the geometric point of 
view (e.g., the material in the books of Arnold [1973] or Hirsch and Smale 
[1974]) would be ideal. 

Chapters 1-17 form the core of the first term material. It provides stu- 
dents with the basic concepts and tools for the study of dynamical systems 
theory. I tend to cover chapters 7, 11 and 12 at a brisk pace. The main 
point there is the ideas and main results. The details can be grasped over 
time, and in other settings. Chapters 13-17 could be viewed as belonging to 
the common theme of “dynamical systems with special structure.” Chap- 
ter 14 is the most important of these chapters. The relation, and contrasts, 
between Hamiltonian and reversible systems is useful to understand, and 
is the reason for including chapter 16. I often just assign selected back- 
ground reading from chapter 13, but knowledge of the relation between 
Lagrangian and Hamiltonian dynamical systems is of growing importance 
in applications. Gradient dynamical systems arise in numerous applica- 
tions (e.g., in biologically related areas) and knowledge of the nature of 
their dynamics, and how it contrasts with, e.g., Hamiltonian dynamics, is 
important. Chapter 17 is short, but I have always felt that students should 
be aware of these results because there are numerous examples of systems 
arising in applications that experience a “transient temporal disturbance.” 
Throughout the early chapters I discuss a number of results and theoretical 
frameworks for general nonautonomous vector fields (i.e., time-dependent 
vector fields whose time dependence is not periodic). This area tradition- 
ally has not been a part of dynamical systems from a geometric point of 
view, but this situation is changing rapidly, and I believe it will play an 
increasingly important role in applications in the near future. 

Chapters 18-22 are covered in the second term. The subject is “local 
bifurcation theory.” The two key tools for the local analysis of dynami- 
cal systems are center manifold theory and normal form theory, covered in 
chapters 18 and 19. The chapter on normal form theory is greatly expanded 
from the first edition. The main new material is the normal form work of 
Elphick, Tirapegui, Bracket, Coullet and looss, a discussion of Hamilto- 
nian normal form theory (following Churchill, Kummer, and Rod), and 
some material on symmetries (whose possible existence, and implications, 
should be considered in the course of study of any dynamical system) . Pos- 
sibly sections 19.1-19.3 could have been omitted in this edition; however 
it has been my experience that students understand the later (and more 
difficult) material more easily once they have been exposed to this more 
pedestrian introduction. In chapters 20 and 21 I tend not to cover in much 
detail the material related to the codimension of a bifurcation and versal 
deformations. This is a standard language used in discussing the subject 
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and it is important that the students have all the details available to them 
for background reading and see it in the context of the material I lecture 
on. New material on Hamiltonian bifurcations and circle maps is included. 
The inclusion of introductory material on Hamiltonian bifurcations is an 
example of the effort to have a broader and more unified point of view as 
discussed earlier. For example, we first describe the “generic saddle-node 
bifurcation at a single zero eigenvalue.” It is then natural to ask about the 
saddle-node bifurcation in a Hamiltonian system, which turns out to be 
rather different. Chapter 22 mainly serves as a warning that the way in 
which bifurcation phenomena are discussed in applications may not agree 
with the mathematical reality, and appropriate pointers to the literature 
are given. 

Chapters 23-33 are covered in the third term. The subject is “global 
dynamics, bifurcations, and chaos.” There is a sprinkling of new material 
throughout these chapters (e.g., a proof of a simple version of the lambda 
lemma and a proof of the shadowing lemma), but the structure is basically 
the same as the first edition. 

There is not a great deal of overlap between the material in the individ- 
ual terms, and with the appropriate prerequisites, each of these one term 
courses could be viewed as an independent course in itself. The textbook 
provides the necessary background for the students to make this a possi- 
bility. 

Some material has been left out of this edition; in particular, material 
on averaging, the subharmonic Melnikov function, and lobe dynamics. The 
reason is that over time I have begun to cover averaging and the subhar- 
monic Melnikov function as topics in a course solely devoted to perturbation 
methods. I cover lobe dynamics in a course devoted to transport phenom- 
ena in dynamical systems, which has developed in the last ten years to the 
point that it now justifies an independent course of its own, with applica- 
tions taken from many diverse disciplines. 

It has been my experience over time that a significant obstacle for stu- 
dents in their study of the subject is the sheer amount of (initially) unfa- 
miliar jargon. In order to make this a bit easier to deal with I have now 
included a glossary of frequently used terms. The bibliography has also 
been updated and greatly expanded. 

I would also like to take this opportunity to express my gratitude to 
the National Science Foundation and to Dr. Wen Masters and Dr. Reza 
Malek-Madani of the Office of Naval Research for their generous support 
of my research over the years. Research and teaching are two sides of the 
same coin, and it is only through an active and fruiful research program 
that the teaching becomes alive and relevant. 



Bristol, England 

2003 



Stephen Wiggins 
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Introduction 



In this book we will study equations of the following form 



X = f{x,t] fi), 


(0.0.1) 


X H> g{x;p,), 


(0.0.2) 



with X G U C K”, t G and fj, € V C where U and V are open 
sets in R" and R^, respectively. The overdot in (0.0.1) means “Jj,” and 
we view the variables as parameters. In the study of dynamical systems 
the independent variable is often referred to as “time.” We will use this 
terminology from time to time also. We refer to (0.0.1) as a vector field or 
ordinary differential equation and to (0.0.2) as a map or difference equa- 
tion. Both will be termed dynamical systems. Before discussing what we 
might want to know about (0.0.1) and (0.0.2), we need to establish a bit of 
terminology. 

By a solution of (0.0.1) we mean a map, a;, from some interval / C R^ 
into R", which we represent as follows 

x:/^R”, 

t i-G x{t), 



such that x{f) satisfies (0.1), i.e., 

x{f) = f{x{t),t;fj). 

The map x has the geometrical interpretation of a curve in R", and (0.0.1) 
gives the tangent vector at each point of the curve, hence the reason for 
referring to (0.0.1) as a vector field. We will refer to the space of dependent 
variables of (0.0.1) (i.e., R”) as the phase space of (0.0.1), and, abstractly, 
our goal will be to understand the geometry of solution curves in phase 
space. We remark that in many applications the structure of the phase 
space may be more general than R”; frequent examples are cylindrical, 
spherical, or toroidal phase spaces. We will discuss these situations as they 
are encountered; for now we incur no loss of generality if we take the phase 
space of our maps and vector fields to be open sets in R". 
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We will see in Chapter 7 that solutions of differential equations have 
different properties depending on whether or not the ordinary differential 
equation depends explicity on time. Ordinary differential equations that 
depend explicity on time (i.e., i = are referred to as nonau- 

tonomous or time dependent ordinary differential equations, or vector fields, 
and ordinary differential equations that do not depend explicity on time 
(i.e., X = f{x; fj,)) are referred to as autonomous or time independent ordi- 
nary differential equations, or vector fields. 

It will often prove useful to build a little more information into our 
notation for solutions, which we describe below. 

Dependence on Initial Conditions 

It may be useful to distinguish a solution curve by a particular point in 
phase space that it passes through at a specific time, i.e., for a solution x{t) 
we have x{to) = xq. We refer to this as specifying an initial condition. This 
is often included in the expression for a solution by writing x{t,to,xo). In 
some situations explicitly displaying the initial condition may be unimpor- 
tant, in which case we will denote the solution merely as x(t). In still other 
situations the initial time may be always understood to be a specific value, 
say to = 0; in this case we would denote the solution as x{t,xo)- 

Dependence on Parameters 

Similarly, it may be useful to explicitly display the parametric dependence 
of solutions. In this case we would write x{t,toiXo] p), or, if we weren’t 
interested in the initial condition, x(t; p). If parameters play no role in our 
arguments we will often omit any specific parameter dependence from the 
notation. 

Some Terminology 

1. There are several different terms which are somewhat synonymous 
with the term solution of (0.0.1). x{t,to,xo) may also be referred to 
as the trajectory or phase curve through the point xq at t = to. 

2. The graph of x{t, to, xo) over t is referred to as an integral curve. More 
precisely, graph x{t,to,xo) = { (a;, t) G K” x \ x = x{t,to,xo), t G 
I } where / is the time interval of existence. 

3. Let Xo be a point in the phase space of (0.1). By the orbit through xo, 
denoted 0(xq), we mean the set of points in phase space that lie on a 
trajectory passing through xq. More precisely, for xo & U C M", the 
orbit through xo is given by O{xo) = {x G K" \ x = x{t,to,xo), t G 
I }. Note that for any T G /, it follows that 0{x(T,to,xo)) = O{xo). 

Let us now give an example that illustrates the difference between trajec- 
tories, integral curves, and orbits. 
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Example 0.0.1. Consider the equation 

u = V, 

V = —u, (u,v) G X R^. (0.0.3) 

The solution passing through the point {u,v) — (1,0) at t = 0 is given by 
(u(t),v{t)j = (cost, — sint). The integral curve passing through (u, v) = (1, 0) at 

t = 0 is given by { (u,v,t) € R^ x R^ x R^ | (^u{t),v{t)J = (cost, — sin t), for all 

t € R }. The orbit passing through (u,v) = (1, 0) is given by the circle u^ + v^ — 1. 
Figure 0.0.1 gives a geometrical interpretation of these different definitions for 
this example. 

End of Example 0.0.1 



V 




(c) 



FIGURE 0.0.1. a) Solution through (1,0) at t = 0. b) Integral curve through 
(1, 0) at t = 0. c) Orbit of (1, 0). 

The astute reader will note that we have apparently gotten a bit ahead 
of ourselves in that we have tacitly assumed that (0.0.1) has solutions. 
Of course, this is by no means obvious, and apparently some conditions 
must be placed on /(x,t;/i) (as of yet, none have been stated) in order 
for solutions to exist. Moreover, additional properties of solutions, such as 
uniqueness and differentiability with respect to initial conditions and pa- 
rameters, are necessary in applications. When we explicitly consider these 
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questions in Chapter 7, we will see that these properties also are inherited 
from conditions on f{x,t; fj,). For now, we will merely state without proof 
that if f{x, t; fj,) is C’^ (r > 1) in x, t, and /r then solutions through any 
Xq € K" exist and are unique on some time interval. Moreover, the solu- 
tions themselves are C’’ functions of t, to, xq, and /t. (Note: recall that a 
function is said to be C*" if it is r times differentiable and each derivative 
is continuous; if r = 0 then the function is merely continuous.) 

At this stage we have said nothing about maps, i.e.. Equation (0.0.2). 
In a broad sense, we will study two types of maps depending on g{x; ^); 
noninvertible maps if g{x; g) as a function of x for fixed g has no inverse, 
and invertible maps if g(x] g) has an inverse. The map will be referred to 
as a C’’ diffeomorphism if g{x; g) is invertible, with the inverse denoted 
g), and both g{x', g) and g~^{x; g) are C’’ maps (recall that a map 
is invertible if it is one-to-one and onto). Our goal will be to study the 
orbits of (0.0.2), i.e., the bi-infinite (if g is invertible) sequences of points 



{■■■ ,9 '^{xo-,g),--- ,g ^{xo] g),xo,g{xo', g),- ■ ■ g'^{xo', g),- ■ (0.0.4) 

where xq G U and g” is defined inductively by 

g^{xo;g) = g{g"'~'^{xo;g)), n > 2, (0.0.5) 

g~^{xo;g) = g~^{g~”^+^{xo',g)), n > 2, (0.0.6) 

or the infinite (if g is noninvertible) sequences of points 

g{xo; g),-- ■, 9 ""{xo; g),-- •}, (0.0.7) 

where xq & U and g” is defined inductively by (0.0.5). (Note: it should 
be clear that we must assume g'^~^{xo] g), g~"'~^^ { xq] g) G U, n > 2, for 
(0.0.4) to make sense and g‘^~^{xo; g) G U, n > 2, for (0.0.7) to make sense.) 
Notice that questions of existence and uniqueness of orbits for maps are 
obvious and that differentiability of orbits with respect to initial conditions 
and parameters is a consequence of the applicability of the chain rule of 
elementary calculus. 

With these preliminaries out of the way, we can now turn to the main 
business of this book. 
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Equilibrium Solutions, 
Stability, and Linearized 
Stability 



1.1 Equilibria of Vector Fields 

Consider a general autonomous vector field 

X = f{x), X G K". (l-l-l) 

An equilibrium solution of (l-l-l) is a point x G K" such that 

f{x) = 0, 

i.e., a solution which does not change in time. Other terms often substituted 
for the term “equilibrium solution” are “fixed point,” “stationary point,” 
“rest point,” “singularity,” “critical point,” or “steady state.” In this book 
we will utilize the terms equilibrium point or fixed point exclusively. 

Example 1.1.1 (“Equilibria” in Nonautonomous Vector Fields). What about 
the notion of equilibria for nonautonomous vector fields? This is a situation where 
doing what “seems right” (i.e.. using ideas that have only been developed for 
autonomous vector fields) can lead to incorrect results. Let is describe this in 
more detail 

Consider a nonautonomous vector field 

X = f{x, t), X G R”. 

A common way of viewing this in applications is to view time as ’’frozen” and 
look at equilibria of the frozen time vector field (this is often done in fluid me- 
chanics where the vector field has the interpretation as the velocity field). These 
“instantaneous” fixed points are given by 

f{x,t) = 0. 

If we can find a point {x,t) such that f{x,t) = 0 and Dxf{x,t) yf 0, then by 
the implicit function theorem we can find a function x{t), with x(t) = x such 
that f(x(t),t) = 0, for t in some interval about t. However, these “frozen time 
equilibria” are not solutions of the nonautonomous vector field. In fact, it is easy 
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to see that if x{t) is a solution of the nonautonomous vector field, then it must 
be constant in time, i.e., x{t) = 0, see Exercise 11. 

The following example from Szeri et al. [1991] is quite instructive. Consider the 
one dimensional nonautonomous vector field: 

X = —X + t, (1-1-2) 

The solution through the point lo at t = 0 is given by 

x{t) = t — 1 + e~^ {xo + 1) , (1.1.3) 

from which it is clear that all solutions asymptotically approach the solution t—l 
as t ^ oo. 

The frozen time or “instantaneous” fixed points for (1.1.2) are given by 

X = t. (1-1-4) 

At a fixed t, this is the unique point where the vector field is zero. However, x = t 
is not a solution of (1.1.2). This is very different from the case of an autonomous 
vector field where a fixed point is a solution of the vector field. 

In Fig. 1.1.1 we plot some of the trajectories of (1.1.2). In particular we plot the 
curve of instantaneous fixed points. 




FIGURE 1.1.1. The trajectories of (1.1.2) plotted in x — t space. The curve of 
instantaneous fixed points is plotted as a dashed line and given hy x = t. 

In Fig. 1.1.2 we plot the “frozen time” vector field at some time t = i. In this 
figure we see something that seems somewhat counterintuitive. Trajectories to the 
right of the trajectory x{t) = t — 1 appear to be moving away from it according 
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to the direction of the instantaneous vector held, towards the instantaneous hxed 
point. However, we know from (1.1.3) that all trajectories decay to x{t) = t — 
1 at an exponential rate. What we are “seeing” in Fig. 1.1.2 is an artifact of 
drawing incorrect conclusions from instantaneous vector helds. Trajectories to 
the immediate right of a;(t) = t — 1 are indeed moving to the right (i.e., away 
from x{t) = t— 1). However, x{t) = t— 1 is moving to the right at a faster speed 
and it eventually overtakes these trajectories. Fig. 1.1.2 might also lead us to 
believe that trajectories converge to the instantaneous hxed point. But we know 
this is not true since we have the exact solutions. 



X 

x=f-l x=T 

FIGURE 1.1.2. The “frozen time” vector held. 

This example shows that it can be misleading and lead to incorrect conclusions 
if we try to think of nonautonomous vector helds as a “sequence” of frozen time 
autonomous vector helds. 



End of Example 1.1.1 



Once we find any solution of (1.1.1) it is natural to try to determine if 
the solution is stable. 



1.2 Stability of Trajectories 

Let x{t) be any solution of (1.1.1). Then, roughly speaking, x{t) is stable 
if solutions starting “close” to x{t) at a given time remain close to x{t) for 
all later times. It is asymptotically stable if nearby solutions not only stay 
close, but also converge to x{t) as t 1 oo. Let us formalize these ideas. 

Definition 1.2.1 (Liapnnov Stability.) x{t) is said to be stable (or 
Liapunov stable) if, given e > 0, there exists a S = S{e) > 0 such that, for 
any other solution, y(t), of (1.1.1) satisfying |l:(to) ~ 2/(^o)| < ^ (where \ ■ \ 
is a norm on K"), then \x{t) — y{t)\ < e for t > to, to S K. 

We remark that a solution which is not stable is said to be unstable. 

Definition 1.2.2 (Asymptotic Stability) x{t) is said to be asymptot- 
ically stable if it is Liapunov stable and for any other solution, y{f), of 
(1.1.1), there exists a constant b > 0 such that, if |a;(to) — y(to)\ < b, then 
lim \x{t) - y{t) \ = 0. 
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See Figure 1.2.1 for a geometrical interpretation of these two definitions. 
Notice that these two definitions imply that we have information on the 





FIGURE 1.2.1. a) Liapunov stability, b) Asymptotic stability. 

infinite time existence of solutions. This is obvious for equilibrium solutions 
but is not necessarily so for nearby solutions. Also, these definitions are for 
autonomous systems, since in the nonautonomous case it may be that S 
and b depend explicitly on to (more about this later). 

Definition 1.2.2 may seem a bit strange in that we require stability in 
addition to the requirement that trajectories approach the solution as 
t — >■ 00 . One might think that the latter condition implies stability. 
However, this is not the case as is illustrated in the two phase portraits 
in Figure 1.2.2. 

Both phase portraits show an equilibrium with the property that so- 
lutions in any arbitrarily small neighborhood leave the neighborhood. 
However, the global topology of the trajectories is such that all tra- 
jectories in a neighborhood asymptotically approach the equilibrium as 
t — >■ 00 . Thus, solutions can attract & neighborhood, but not be Liapunov 
stable. 

Thus far we have discussed stability of trajectories. Now we define a 
slightly different, but important, notion of stability that will be generalized 
later on-orbital stability (recall the distinction between “trajectories” and 
“orbits” given in the introduction). 

First, we define the positive orbit through the point Xq for t > to as: 

{xq, to) = {a: G R” I a; = x{t), t > to, x{to) = xq} . (1-2-1) 

Next we need the notion of the distance between a point and a set. This 
is defined as follows. Let S G K" be an arbitrary set and p G R” be an 
arbitrary point. Then the distance between the point p and the set S is 
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(a) 




FIGURE 1.2.2. Phase portraits a) and b) illustrate the situation of an equilibrium 
point that is unstable, but all trajectories in a neighborhood of the equilibrium 
point approach it as t — >■ oo. 

denoted and defined as: 



d{p, S) = inf \p — x\. (1-2-2) 

xes 

Now we can state the following definition. 

Definition 1.2.3 (Orbital Stability) x{t) is said to be orbitally stable 
if, given e > 0, there exists a S = S{e) > 0 such that, for any other solution, 
y{f), of (1.1.1) satisfying |x(to) - y{to)\ < S, then d (y(t), 0+(xo, to)) < £ 
for t > to. 

We can also now define asymptotic orbital stability. 

Definition 1.2.4 (Asymptotic Orbital Stability) x{t) is said to be 
asymptotically orbitally stable if it is orbitally stable and for any other 
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solution, y{t), of (1.1.1), there exists a constant b > 0 such that, if |a;(io) — 
y{to) \ <b, then lim d (y{t),0''~{xo,to)) =0. 

Definitions 1.2.3 and 1.2.4 are stated in terms of orbital stability of a tra- 
jectory. In practice, variations in the terminology may arise. In particular, 
one could just as easily phrase these definitions in terms of the stability of 
the orbit (generated by the trajectory), rather than the trajectory. This is 
an example of where the distinction between “orbit” and “trajectory” is 
slightly blurred. 

Definitions 1.2.1, 1.2.2, 1.2.3, 1.2.4 describe mathematically different 
types of stability; however, they do not provide us with a method for deter- 
mining whether or not a given solution is stable. We now turn our attention 
to this question. 



1.2 a Linearization 

In order to determine the stability of x(t) we must understand the nature 
of solutions near x{t). Let 

X = x(f) -b y. (1.2.3) 

Substituting (1.2.3) into (1.1.1) and Taylor expanding about x{f) gives 

X = x{f) +y= f{x{t)) + Df{x{t))y + 0{\y\^), (1.2.4) 

where Df is the derivative of / and | • | denotes a norm on K" (note: in 
order to obtain (1.2.4) / must be at least twice differentiable). Using the 
fact that x{f) = /(x(t)), (1.2.4) becomes 

y = Df{x{t))y + 0{\y\^). (1.2.5) 

Equation (1.2.5) describes the evolution of orbits near x{t). For stability 
questions we are concerned with the behavior of solutions arbitrarily close 
to x{t), so it seems reasonable that this question could be answered by 
studying the associated linear system 

y = Df{x{t))y. (1.2.6) 

Therefore, the question of stability of x{t) involves the following two steps: 

1. Determine if the y = 0 solution of (1.2.6) is stable. 

2. Show that stability (or instability) of the y = 0 solution of (1.2.6) 
implies stability (or instability) of x{t). 

Step 1 may be equally as difficult as our original problem, since there are 
no general analytical methods for finding the solution of linear ordinary 
differential equations with time-dependent coefficients. However, if x{t) is 
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an equilibrium solution, i.e., x{t) = x, then Df{x{t)) = Df{x) is a matrix 
with constant entries, and the solution of (1.1.6) through the point yo G K” 
of t = 0 can immediately be written as 

y(t) = (1.2.7) 

Thus, y(t) is asymptotically stable if all eigenvalues of Df{x) have negative 
real parts (cf. Exercise 7). 

The answer to Step 2 can be obtained from the following theorem. 

Theorem 1.2.5 Suppose all of the eigenvalues of Df{x) have negative 
real parts. Then the equilibrium solution x = x of the nonlinear vector field 
(1.1.1) is asymptotically stable. 

Proof: We will give the proof of this theorem in Chapter 2 when we discuss 
Liapunov functions. □ 



Example 1.2.1 (Stability and Eigenvalues of Time-Dependent Jacobians). For 
a general time dependent solution x{t) it might be tempting to infer stability 
properties of this solution from the eigenvalues of the Jacobian Df(x{t)). The 
following example from Hale [1980] shows this can lead to wrong answers. 
Consider the following linear vector field with time-periodic coefficients 



Xl 

X2 



= m 




where 



A{t) = 



— 1 -I- I cos^ t 



1 — I cos t sin t 
— 1 — I cos t sin t — 1 -f I sin^ t 



( 1 . 2 . 8 ) 



The eigenvalues of A{t) are found to be independent of t and are given by 



Ai(t) 



-l + i^ 
4 



\2{t) 



4 



In particular, they have negative real parts for all t. However, one can verify that 
the following are two linearly independent solutions of this equation 



Vl{t) = 



/ — cos t \ 
y sin t J 



e^ 



V2(t) = 



/ sint 
\cost 



-t 



e 



(1.2.9) 



Hence, the solutions are unstable and of saddle type, a conclusion that does not 
follow from the eigenvalues of A{t). 



End of Example 1.2.1 
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Example 1.2.2. In this example we show that stability in the linear approx- 
imation does not necessarily imply stability. Consider the following vector field 
on 



1 / 2 , 2x 

x = -y + x{x +y ), 

y = x + y{x^ + y^). ( 1 . 2 . 10 ) 

The origin is an equilibrium point for this equation, and the vector field linearized 
about this equilibrium is given by 



X = -y, 

y = x. ( 1 . 2 . 11 ) 



The eigenvalues of the matrix associated with this linearization are ±i, and the 
origin is stable in the linear approximation (but not asymptotically stable). 

Next we examine nonlinear stability. We transform (1.2.10) to polar coordi- 
nates using 

x = rcos9, y = rsin9, 

to obtain 

3 

r = r , 

9 = 1. 



Since r > 0 we see that r is increasing, indicating that solutions spiral away from 
the origin. 



End of Example 1.2.2 

Sometimes the term “linearly stable” is used to describe a solution that 
is stable in the linear approximation. Thus, linearly stable solutions may 
be nonlinearly unstable. 

In the following sections the reader will see many results that have a 
similar flavor to Theorem 1.2.5. Namely, if the eigenvalues of the associated 
linear vector held have nonzero real parts, then the orbit structure near an 
equilibrium solution of the nonlinear vector held is essentially the same as 
that of the linear vector held. Such equilibrium solutions are given a special 
name. 

Definition 1.2.6 (Hyperbolic Fixed Point) Let x = x be a fixed point 
of i = f{x), X G K". Then x is called a hyperbolic fixed point if none of 
the eigenvalues of Df(x) have zero real part. 

It should be noted that the notion of “hyperbolicity of a fixed point” is 
defined in terms of the linearization about the fixed point. The notion of 
hyperbolicity extends to general trajectories, as well as to invariant sets and 
manifolds. In all of these cases hyperbolicity will also be defined in terms of 
the linearization about the trajectory, invariant set, or invariant manifold. 
Moreover, we will learn that hyperbolicity “persists under perturbations” . 
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Historically, hyperbolicity has been a central concept in the development 
of dynamical systems theory. 

It should be clear that in our studies of stability of equilibria in the linear 
approximation the nature of the linearized stability will boil down to a 
study of the nature of the roots of the characteristic polynomial of the 
matrix associated with the linearization about the equilibrium point of 
interest. Here we collect together a few very useful results about the 
roots of polynomials. 

Consider a polynomial with real coefficients of the form: 

p(A) = cqA -|- uiA + • • • + an—i\ + Un, t^i M, no 7^ 0. (1.2.12) 

Theorem 1.2.7 (Fundamental Theorem of Algebra) (1.2.12) has exactly 
n real or complex roots, Ai,...,A„, where repetition of roots is possible, i.e., 
Ai = Aj, for some i and j. 

Since we are considering the case of polynomials with real coeffieients, 
it is easy to verify that if A is a root of (1.2.12) then so is the complex 
conjugate of A, A (just substitute the root A into (1.2.12) and take the 
complex conjugate of the result, using the fact that the coefficients are 
real). Hence, for polynomials with real coefficients the roots occur in 
complex conjugate pairs. 

Next we describe a useful result that enables to get some information 
about the roots of polynomials just by “looking at” the coefficients. 

Theorem 1.2.8 (Descartes’ Rule of Signs) Consider the sequence of coefi- 
cients of (1.2.12): 

Ojn 1 ^n — \ 1 ■ ■ ■ ? CE-i , CLq . 

Let k be the total number of sign changes from one coefficient to the next in the 
sequence. Then the number of positive real roots of the polynomial is either equal 
to k, or k minus a positive even integer. (Note: if k = 1 then there is exactly one 
positive real root.) 

Example 1.2.3. Consider the polynomial: 

A^ - 2A + 1 = 0. 

There are two sign changes in the sequence of coefficients, and the roots are 1 
and 1. 



End of Example 1.2.3 

Proofs of the fundamental theorem of algebra and Descartes’ rule of 
signs can be found in many algebra textbooks. 

We are interested in the location in the complex plane of the real parts 
of the roots of the polynomial (1.2.12). The famous Routh-Hurwitz cri- 
terion can be very useful for this purpose. 
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First we construct the Routh table associated with the polynomial 
(1.2.12). This is given by: 



Oq 


02 


04 


06 


ai 


03 


05 


07 


f3,l 


rs ,2 


rs.s 


rs,4 


^4,1 


U4,2 


^4,3 


V4,4 


'ti+1,1 


'f'n+1,2 


rn + 1,3 


^n-t-1,4 



where 

(ri,i Ti ^2 ■ ■ ■) = {ri- 2,2 Vi-2,3 ■■■) - (y.._ (* > 2). 

ri-1,1 

(1.2.14) 

(The notation rij stands for row i, column j.) Note that rows three and 
higher may not contain the same number of entries as rows one or two. 

This will be seen in an example below. Now we state the following test. 



Theorem 1.2.9 (Routh-Hurwitz Test) All of the roots of the polynomial 
(1.2.12) have real parts strictly less than zero if and only if all n + 1 elements in 
the first column of the Routh table are nonzero and have the same sign. 



An elementary proof of the Routh-Hurwitz criterion can be found in 
Meinsma [1995]. A comprehensive reference is Gantmacher [1989], which 
also covers certain “singular” cases not covered by the result stated here 
(the so-called “regular case”). 



Example 1.2.4. 

Consider the polynomial: 

A®-t6AVllA-b6 = 0. (1.2.15) 



The associated Routh table is given by: 



1 11 
6 6 
10 
6 



Hence, all of the roots of this polynomial lie in the left half plane (they are — 1, 
—2, and —3). 



End of Example 1.2.4 
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Everything discussed thus far applies also for maps; we mention some of 
the details explicitly. 

Consider a O'" (r > 1) map 

X I— >■ g{x), X G K”, (1.3.1) 

and suppose that it has a fixed point at a; = a;, i.e., x = g{x). The associated 
linear map is given by 



y^Ay, yG R”, 



(1.3.2) 



where A = Dg(x). 



1.3a Definitions of Stability for Maps 

The definitions of stability and asymptotic stability for orbits of maps are 
very similar to the definitions for vector fields. We leave it as an exercise 
for the reader to formulate these definitions (cf. Exercise 4). 

1.3b Stability of Fixed Points of Linear Maps 

Choose a point ?/o G The orbit of y^ under the linear map (1.3.2) is 
given by the bi-infinite sequence (if the map is a C’’, r > 1, diffeomorphism) 

{• • • , A“”yo, • • • , A~^yQ, yo, Ay^, ■■■ , A”i/o, • • •} (1.3.3) 

or the infinite sequence (if the map is C, r > 1, but noninvertible) 



{yo,Ayo,--- ,A^yo,---}. (1.3.4) 

From (1.3.3) and (1.3.4) it should be clear the fixed point y = 0 of the 
linear map (1.3.2) is asymptotically stable if all of the eigenvalues of A 
have moduli strictly less than one (cf. Exercise 9). 



1.3c Stability of Fixed Points of Maps via the 
Linear Approximation 

With the obvious modifications, Theorem 1.2.5 is valid for maps. 

Before we apply these ideas to the unforced Duffing oscillator, let us first 
give some useful terminology. 
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1.4 Some Terminology Associated with Fixed 
Points 

A hyperbolic fixed point of a vector field (resp., map) is called a saddle if 
some, but not all, of the eigenvalues of the associated linearization have real 
parts greater than zero (resp., moduli greater than one) and the rest of the 
eigenvalues have real parts less than zero (resp., moduli less than one). If 
all of the eigenvalues have negative real part (resp., moduli less than one), 
then the hyperbolic fixed point is called a stable node or sink, and if all of 
the eigenvalues have positive real parts (resp., moduli greater than one), 
then the hyperbolic fixed point is called an unstable node or source. If the 
eigenvalues are purely imaginary (resp., have modulus one) and nonzero 
(resp., are not real), the nonhyperbolic fixed point is said to be a center 
(resp., is said to be elliptic). 

Let us now apply our results to the unforced Duffing oscillator. 

1.5 Application to the Unforced Duffing Oscillator 

The unforced Duffing oscillator is given by 
x = y, 

y = X — — Sy, i5 > 0. 

It is easy to see that this equation has three fixed points given by 

(cr, 2 /) = (0,0),(±I,0). (I.5.I) 

The matrix associated with the linearized vector field is given by 

(l-U 1)- (1-5.2) 

Using (1.5.1) and (1.5.2) the eigenvalues Ai and A 2 associated with the 
fixed point (0, 0) are given by Ai _2 = — 5/2± + 4, and the eigenvalues 

associated with the fixed points (±1, 0) are the same for each point and are 
given by Ai ,2 = —6/2 ± 6“^ — 8. Hence, for <5 > 0, (0,0) is unstable and 

(±1, 0) are asymptotically stable; for <5 = 0, (±1, 0) are stable in the linear 
approximation . 

1.6 Exercises 

1. Consider the following vector fields. 



. X — y, 
a ) . ^ j. 

y = -dy - yx 



(x,y) G RL 
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b) 



c) 

d) 



e) 



f) 



h) 



X — 

y = 

X — 

y = 

X = 

y = 



y. 

—5y — y,x — x ^ , 

y, 

— <5y — fix — , 

— 5x — fiy xy, 
y,x — 6y \ {x‘^ 



(x,y) G R^. 

(x, y) G 

(x, y) G R^. 



X — —X + x^ , 

y = x + y, 

r — r(l — r^), 
0 — cos 40, 



(x,y) G R^. 

(r, 0) G R’*' X 



r — r{S + — r^), 

9 = l-r^, 

9 = v, 

i} = — sin 9 — 5v + 



(r,9) G R+ X 
(9,v) G X R. 



i) 



92 = 1^2 + , n > 1, 



( 01 , 62 ) e X S\ 



j) 

k) 



01 — 02 — sin 01 , 

02 — —02i 



Bi - Bl, 
02 — 2 , 



{61,62) G X S" 
(61,62) G X S\ 



Find all fixed points and discuss their stability. 
2. Consider the following maps. 



b) 



X X, 

y ^ X 4 -y, 

X x^ , 

y ^ X 4 -y, 

01 I-4- 01 , 

02 4 - 02 , 



{x, y) G IR . 

(x, y) e R^. 

(61,62) G S" X S\ 



d) ( 6 i, 62 )^S-xS\ 



e) 



2xy 
x + y 
/ 2xy 

V ^+y 



2 n 1/2 (x,y) G 1 



h) 



y ^ (xy) ' , 

(x, y) G R^. 

+ , 0,1 (6,v) G X r\ 

dv — p.cos(y + d), ' ' 



X fi — 6y — x^ , 
y X, 

0 ^ 0 + v, 



Find all the fixed points and discuss their stability. 

3. Consider a (r > 1) diffeomorphism 

X 1 - 4 - f{x ) , x G . 

Suppose / has a hyperbolic periodic orbit of period k. Denote the orbit by 



0(p) = {p,/(p),/^(p), •••,/*’ ^(p)./'‘(p) =p}. 

Show that stability of 0 (p) is determined by the linear map 

y M- Df'‘(f(p))y 

for any j = 0, 1, ■ ■ ■ , k— 1. Does the same result hold for periodic orbits of noninvertible 
maps? 
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4. Formulate the definitions of Liapunov stability and asymptotic stability for maps. 

5. Show that hyperbolic fixed points of maps which are asymptotically stable in the linear 
approximation are nonlinearly asymptotically stable. 

6. Give examples of fixed points of vector fields and maps that are stable in the linear 
approximation but are nonlinearly unstable. 

7. Consider the linear vector field 

X — Ax, X G , 

where A is an n X n constant matrix. Suppose all the eigenvalues of A have negative 
real parts. Then prove that a: = 0 is an asymptotically stable fixed point for this linear 
vector field. {Hint: utilize a linear transformation of the coordinates which transforms 
A into Jordan canonical form.) 

8. Suppose that the matrix A in Exercise 7 has some eigenvalues with zero real parts 
(and the rest have negative real parts). Does it follow that a? = 0 is stable? Answer 
this question by considering the following example. 

(»)-(s ;)(»)■ 

9. Consider the linear map 

X 1-^ Ax, X G , 

where A is an n X n constant matrix. Suppose all of the eigenvalues of A have modulus 
less than one. Then prove that a: = 0 is an asymptotically stable fixed point for this 
linear map (use the same hint given for Exercise 7). 

10. Suppose that the matrix A in Exercise 9 has some eigenvalues having modulus one 
(with the rest having modulus less than one). Does it follow that rc = 0 is stable? 
Answer this question by considering the following example. 




11. Consider a nonautonomous vector field 

X = f{x, t), X G K”', 

and suppose that x{i) is a function (defined for some interval of t) satisfying 

= 0 . 

Prove that if x{t) is a trajectory of the vector field then it must be constant in time. 

12. Consider the following vector field (Yang [2001]): 

X — —X, 

- 1, 

e = uj, {x,<p,e) e Rx X 

where cj is an irrational number. Show that every trajectory is asymptotically orbitally 
stable. 

13. Consider the following vector field (Yang [2001]): 

9 ^ sin^ ^ + (1 - rf, 
r — r(l — r), {9, r) G 5^ X R. 



Show that every trajectory, except r = 0, is asymptotically orbitally stable. 
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14. Does Descartes’ rule of signs provide any information about the roots of the polynomial 
p( — A), where p(A) is given by (1.2.12)? 

15. Use the Routh-Hurwitz test to determine the location of the roots of the following 
polynomials: 

(a) - 3A^ + 3A - 1, 

(b) A^ + 3A^ - 4, 

(c) A^ + A^ + A + 1. 
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Liapunov Functions 



The method of Liapunov can often be used to determine the stability of 
fixed points when the information obtained from linearization is incon- 
clusive (i.e., when the fixed point is nonhyperbolic) . Liapunov theory is a 
large area, and we will examine only an extremely small part of it; for more 
information, see Lasalle and Lefschetz [1961]^. 

The basic idea of the method is as follows (the method works in n- 
dimensions and also infinite dimensions, but for the moment we will de- 
scribe it pictorally in the plane). Suppose you have a vector field in the 
plane with a fixed point x, and you want to determine whether or not it is 
stable. Roughly speaking, according to our previous definitions of stability 
it would be sufficient to find a neighborhood U of x for which orbits starting 
in U remain in U for all positive times (for the moment we don’t distinguish 
between stability and asymptotic stability). This condition would be satis- 
fied if we could show that the vector field is either tangent to the boundary 
of U or pointing inward toward x (see Figure 2.0.1). This situation should 
remain true even as we shrink U down onto x. Now, Liapunov’s method 
gives us a way of making this precise; we will show this for vector fields in 
the plane and then generalize our results to M". 

Suppose we have the vector field 

i = f{x,y), 

y = g{x,y), {x,y)&R^, ( 2 . 0 . 1 ) 

which has a fixed point at {x,y) (assume it is stable). We want to show 
that in any neighborhood of {x, y) the above situation holds. Let V{x,y) 
be a scalar-valued function on i.e., — >■ (and at least C^), with 



^First, something should be said about the spelling of the name “Liapunov”, 
also spelled as “Lyapunov” , and , “Liapounoff” . The book of Lasalle and Lefschetz 
[1961] uses “Liapunov”. Hirsch and Smale [1974] use “Liapunov”, while Arnold 
[ 1973 ] uses “Lyapunov” . In the setting of “Lyapunov exponents” the “y” is more 
typically used (although the influential paper of Eckmann and Ruelle [1985] uses 
“i”). A random check of the journal Systems & Control Letters over the past few 
years found both “Liapunov” and “Lyapunov” both used about the same number 
of times. For this reason many references in the bibliography of this book will 
have different spellings of “L • apunov”. From time to time we will also drift 
between different spellings when a particular spelling is more usually found in 
the area under discussion. 
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FIGURE 2.0.1. The vector field on the boundary of U. 

V{x,y) = 0, and such that the locus of points satisfying V{x,y) = C = 
constant form closed curves for different values of C encircling (T, y) with 
V{x,y) > 0 in a neighborhood of (x,y) (see Figure 2.0.2). 




FIGURE 2.0.2. Level set of V and VU denoted at various points on the boundary. 

Now recall that the gradient of V, W, is a vector perpendicular to the 
tangent vector along each curve V = C which points in the direction of 
increasing V (see Figure 2.0.3). So if the vector field were always to be either 
tangent or pointing inward for each of these curves surrounding (x,y), we 
would have 



\/V{x,y) ■ {x,y) < 0, 
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FIGURE 2.0.3. Level sets of U, 0 < Gi < G 2 < G 3 . 

where the “dot” represents the usual vector scalar product. (This is simply 
the derivative of V along orbits of (2.0.1), and is sometimes referred to 
as the orbital derivative.) We now state the general theorem which makes 
these ideas precise. 

Theorem 2.0.1 Consider the following vector field 

X = f{x), X e K". (2.0.2) 

Let X he a fixed point of (2.0.2) and letV'.U be a function defined 
on some neighborhood U of x such that 

i) V{x) =0 and V{x) >0 if x ^ x. 

ii) V{x) <0 in U — {x}. 

Then x is stable. Moreover, if 

iii) V{x) <0 in U — {x} 
then X is asymptotically stable. 

Proof: Consider a ball centered at x of radius 5, i.e., 

Bs{x) = {a; G M" | |a; — ai| < <5} , 

where S is chosen so small that Bs{x) C U. Let m be the minimum value 
of V on the boundary of Bs(x). Then by i), m > 0. Then let 

Ui = {x & Bs{x)\V (x) < m } , 

see Fig. 2.0.4. Now consider any trajectory starting in U\. By ii), on such 
a trajectory V is non-increasing. Hence by our construction the trajectory 
cannot leave Bs{x). This proves that x is stable since S is arbitrary. 
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Now suppose that iii) holds, so that V is strictly decreasing on orbits in 
U — {x}. Let x{t) be a trajectory starting in Ui — {x}. Then since Bs{x) is 
compact, and passing to a subsequence if necessary, we can find a sequence 
of times {t„}, with — >■ oo as n — >■ oo such that x(t„) converges to a point 

xo as n — >■ oo. We now argue that xq = x. 

This can be seen as follows. We will give a proof by contradiction. Assume 
that Xo yf x. Then there exists an £ sufficiently small such that xq ^ (x). 

Repeating the same argument given above, one can then conclude that 
there exists a neighborhood of x, IJi C B^ (x), such that any trajectory 
starting in Ui cannot leave B^ (x), see Fig. 2.0.4. 




FIGURE 2.0.4. Geometry associated with the choice of neighborhoods in the 
proof of Theorem 2.0.1. 

From this it follows that the trajectory x(t) cannot enter U\. Then, in 
U\ — U\, V is strictly bounded away from zero, i.e., we have the following 
estimate: 

V < —K < 0, for some K > 0. 

Since x(t) cannot enter Ui we can apply this estimate along the trajectory 
x(t) to obtain the following inequality: 

V (x{tn)) - V {x{0)) = f V{x{s))dt, 

JO 

< -Kt„, 
or 

V{x{tn)) <V{x{0))-Ktn. 
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Now as n — >■ oo this inequality imples that V (x(tn)) must become negative. 
This is a contradiction, which came about by assuming xq ^ x. Therefore 
X = Xo- □ 

We remark that there is a slight gap in the proof of this theorem in that 
we have not proved that the trajectory x(t) exists on the semi-infinite time 
interval t G [0,oo). Indeed, we have not considered existence of solutions 
at all at this point in the book. Nevertheless, this fact is indeed true under 
our assumptions, see Theorem 7.2.1. 

We refer to as a Liapunov function. If V < 0 on U — {x} the term 
strict Liapunov function is often used. We remark that if U can be chosen 
to be all of K", then x is said to be globally asymptotically stable, if i) and 
iii) hold. 

Example 2.0.1. Consider the following vector field 

x = y, (2.0.3) 

y = —X ex^y. (2.0.4) 

It is easy to verify that (2.0.3) has a nonhyperbolic fixed point at {x, y) = (0, 0). 
Our goal is to determine if this fixed point is stable. 

Let V{x,y) = {x^ + y^')l‘l. Clearly E(0,0) = 0 and V{x,y) > 0 in any neigh- 
borhood of (0,0). Then 

V{x,y) = W{x,y) ■ {x,y) 

= (x,y) ■ (y,ex^y - x) 

= xy + ex^ y^ — xy 

and hence V = ex^y'^. Then, by Theorem 2.0.1, (0,0) is globally stable for e < 0. 
Actually, with a little more work, one can show that (0, 0) is globally asymptot- 
ically stable for e < 0 



End of Example 2.0.1 



Let us now use Liapunov theory to give an outline of the proof of Theorem 
1.2.5. We begin by recalling the set-up of the problem. 

Consider the vector field 

X = f{x), X G M", (2.0.5) 

and suppose that (2.0.5) has a fixed point at x = x, i.e., f{x) = 0. We 
translate the fixed point to the origin via the coordinate shift y = x — x so 
that (1.1.17) becomes 

y=f{y + x), j/GK”. (2.0.6) 

Taylor expanding (2.0.6) about x gives 



y = Df{x)y + R{y), 



(2.0.7) 




where R{y) = 0(|yp). 

Now let us introduce the coordinate rescaling 
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y = £u, 0 < e < 1. (2.0.8) 

Thus, taking £ small implies making y small. Under (2.0.8) equation (2.0.7) 
becomes 

ii = Df{x)u + R{u, e), (2.0.9) 

where R{u,e) = R{eu)/e. It should be clear that R{u,0) = 0 since R{y) = 
0(|yp). We choose as a Liapunov function 

V{u) = 

Therefore, 

V{u) = W(m) • ii 

= [u ■ Df{x)u) + (u ■ R{u,e)). (2.0.10) 

From linear algebra the reader should recall that if all eigenvalues of Df{xo) 
have negative real part, then there exists a basis such that 

[u ■ Df{xo)u) < k\u\^ < 0 (2.0.11) 

for some real number k and for all u (see Arnold [1973] or Hirsch and 
Smale [1974] for a proof). Hence, by choosing e sufficiently small, (2.0.10) is 
strictly negative, which implies that the fixed point a; = a; is asymptotically 
stable. We leave it to the reader to show that this result does not depend 
on the particular basis for which (2.0.11) holds. This latter point, while 
sounding simple in practice, is much more tricky than one might believe, 
see Arnold [1973] or Hirsch and Smale [1974] for details. 



2.1 Exercises 

1. Suppose X — x{t) is a solution of the nonautonomous equation 

X = f{x, t), X G M””. 

Show that this solution can be transformed to the “zero solution” by the shift 

X — y -h ^(t). 

Hence, conclude that without loss of generality, the study of the stability of an arbitrary 
solution can be transformed to a study of the stability of the solution x = 0, even for 
time-dependent vector fields. 

2. In this exercise we generalize Liapunov’s theorem to nonautonomous systems. Consider 
a C’’, n > 1, vector field 

x — f(x,t)^ (2.1.1) 



satisfying 



/(o.i) = 0. 
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Definition 2.1.1 A function V{x,t) is called positive definite in a region U C if there 
exists a function W {x) with the following properties: 

1. W is defined and continuous in U. 

2. 0 < W (a:) < V {x, t) for x ^ 0 and t > fg ■ 

The derivative of V along trajectories of (2.1.1) is given by 

dV 

V = + W • X 

dt 

dV 

= +VV ■ 

This is the orbital derivative for nonautonomous systems. Prove the following theorem. 

Theorem 2.1.2 Suppose V{x,t) is positive definite in a neighborhood U of x — 0 for 
t > tQ. Then 

i) '^/ ^ (^) t) < 0 m U . 

X — 0 is stable. Moreover, if 

ii) V {x, t) < 0 in U 

then X — 0 is asymptotically stable. 

Hint: Since in the definition of positive definite W{x) is independent oft, you should be 
able to mimic the proof of Theorem 2.0.1. For more information on Liapunov’s method for 
nonautonomous systems see Aeyels [1995]. 

3. Prove Dirichlet’s theorem (Siegel and Moser [1971]). Consider a vector field (r > 1) 

X — f{x), X G M”", 

which has a fixed point at x = x. Let H{x) be a first integral of this vector field defined 
in a neighborhood oi x — x such that x — x \s a nondegenerate minimum of H{x). 
Then x = x is stable. 

4. Prove Liapunov’s theorem for maps, i.e., consider a diffeomorphism 

X 1-4- f{x) , X G M”" , 

and suppose that we have a scalar- valued function 

V: [/ -> 

defined on some open set U G K”" satisfying 

i) V(xo) = 0; 

ii) V {x) > 0 for x ^ xq; 

iii) V o f{x) < V (x) with equality if and only \i x — xq. 

Then rr = is a stable fixed point. Moreover, if strict inequality holds in iii), then a: = a^g is 
asymptotically stable. Does the same result hold for noninvertible maps? 

5. Consider the vector field 

X = —y — x(x + V ), 
y = X - y{x'^ + y'^), (a;,y)eR^. 

Use Liapunov’s method to show that the origin is globally asymptotically stable. 
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6. Consider the damped Duffing equation 



X = y, 

y = X — — Sy, (ai, y) G <5 > 0. 



Use the function 



V{x, y) 



y_ 

2 




as a Liapunov function to show that the equilibrium points (x, y) — (il, 0) are asymp- 
totically stable. 



7. Consider a particle of mass m moving in under the influence of a potential fleld 
y, z). The equations of motion are given by 



, 

mx — \x,y^z), 

dx 

my = -^{x,y,z), 

oy 



mz = ~ — {x,y,z), 
oz 



Prove that a minimum of the potential corresponds to a stable equilbrium point. Is it 
asymptotically stable? How differentiable must y, 2 ) be? 

8. Prove the following instability theorem. Let x — x he axi equilibrium point of a 

r > 1, vector fleld x — f{x), x G R”". Suppose V (x) is a scalar valued function 
satisfying V {x) — 0 and V > 0 in U — {s}, where t/ is a neighborhood of x. If 
U(a:n) > 0 for some sequence Xn x^ then x is unstable. 
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Invariant Manifolds: Linear and 
Nonlinear Systems 



We will see throughout this book that invariant manifolds, in particular 
stable, unstable, and center manifolds, play a central role in the analysis of 
dynamical systems. We will give a simultaneous discussion of these ideas 
for both vector fields 

X = /(x), X G M", (3.0.1) 

and maps 

X I— >■ g{x), X G K”. (3.0.2) 

Definition 3.0.3 (Invariant Set) Let S C K" be a set, then 

a) (Continuous time) S is said to be invariant under the vector field 
X = f{x) if for any xq € S we have x(t, 0 ,xq) G S for all t & M. 
(where x(0, 0, xg) = xg). 

b) (Discrete time) S is said to be invariant under the map x i— >■ g{x) if 
for any Xg € S we have g’^(xg) G S for all n. 

If we restrict ourselves to positive times (i.e., t > 0, n > 0) then we refer 
to S as a positively invariant set and, for negative time, as a negatively 
invariant set. 

Stated succinctly, invariant sets have the property that trajectories start- 
ing in the invariant set, remain in the invariant set, for all of their future, 
and all of their past. 

We remark that if g is noninvertible, then only n > 0 makes sense (al- 
though in some instances it may be useful to consider g~^ which does have 
a set theoretic meaning) . 

Definition 3.0.4 (Invariant Manifold) An invariant set S C K” is said 
to be a C’’ (r > 1) invariant manifold if S has the structure of a C’’ 
differentiable manifold. Similarly, a positively (resp., negatively) invariant 
set S C M" is said to be a (r > 1) positively (resp., negatively^ invariant 
manifold if S has the structure of a C’’ differentiable manifold. 

Evidently, we need to say what we mean by the term “C’’ differentiable 
manifold.” However, this is the subject of a course in itself, so rather than 
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define the concept of a manifold in its full generality, we will describe only 
that portion of the vast theory that we will need. 

Roughly speaking, a manifold is a set which locally has the structure 
of Euclidean space. In applications, manifolds are most often met as m- 
dimensional surfaces embedded in R". If the surface has no singular points, 
i.e., the derivative of the function representing the surface has maximal 
rank, then by the implicit function theorem it can locally be represented 
as a graph. The surface is a C’’ manifold if the (local) graphs representing 
it are C’’ (note: for a thorough treatment of this particular representation 
of a manifold see Dubrovin, Fomenko, and Novikov [1985]). 

Another example is even more basic. Let {si, • • • , s„} denote the stan- 
dard basis on M”. Let , • • • , }, j < n, denote any j basis vectors from 

this set; then the span of {sjj, • • • , forms a j-dimensional subspace of 
M" which is trivially a C°° j-dimensional manifold. For a thorough intro- 
duction to the theory of manifolds with a view to applications see Abraham, 
Marsden, and Ratiu [1988]. 

The main reason for choosing these examples is that, in this book, when 
the term “manifold” is used, it will be sufficient to think of one of the 
following two situations: 

1. Linear Settings: a linear vector subspace of K"; 

2. Nonlinear Settings: a surface embedded in R" which can be locally 
represented as a graph (which can be justified via the implicit function 
theorem) . 

3.1 Stable, Unstable, and Center Subspaces of 
Linear, Autonomous Vector Fields 

Let us return to our study of the orbit structure near fixed points to see 
how some important invariant manifolds arise. We begin with vector fields. 
Let a; G R” be a fixed point of 

X = f{x), X G R". (3.1.1) 

Then, by the discussion in Chapter 1, it is natural to consider the associated 
linear system 

y = Ay, 2 /gR”, (3.1.2) 

where A = Df{x) is a constant nxn matrix. The solution of (3.1.2) through 
the point yo G R" at t = 0 is given by 

y{t) = e^Uo, (3.1.3) 

= id+ At + -b H 



where 



(3.1.4) 
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and “id” denotes the n x n identity matrix. We must assume sufficient 
background in the theory of linear constant coefficient ordinary differential 
equations so that (3.1.3) and (3.1.4) make sense to the reader. Excellent 
references for this theory are Arnold [1973] and Hirsch and Smale [1974]. 
Our goal here is to extract the necessary ingredients from this theory so as 
to give a geometrical interpretation to (3.1.3). 

Now M" can be represented as the direct sum of three subspaces denoted 
E“, and which are defined as follows: 

= spanjei, • • ■ ,eg}, 

= span{es+i, • • • , es+„}, s + u + c = n, (3.1.5) 

E — span{es-|-^t-|_i , * * * , , 

where {ei,---,es} are the (generalized) eigenvectors of A corresponding 
to the eigenvalues of A having negative real part, {e^+i, • • • , Cg+u} are the 
(generalized) eigenvectors of A corresponding to eigenvalues of A having 
positive real part, and {cs+u+i, ■ ■ ■ , e.g+u+c} are the (generalized) eigenvec- 
tors of A corresponding to the eigenvalues of A having zero real part (note: 
this is proved in great detail in Hirsch and Smale [1974]). A®, and 
A® are referred to as the stable, unstable, and center subspaces, respec- 
tively. They are also examples of invariant subspaces (or manifolds) since 
solutions of (3.1.2) with initial conditions entirely contained in either A®, 
E“, or must forever remain in that particular subspace for all time (we 
will motivate this a bit more shortly). Moreover, solutions starting in E® 
approach y = 0 asymptotically as t — >■ -l-oo and solutions starting in A" 
approach y = 0 asymptotically as t — >■ — oo. 

We will now describe the linear algebra behind the definition of the 
stable, unstable, and center subspaces in more detail, largely following the 
discussion in Hirsch and Smale [1974], to which we refer the reader for 
proofs of some of the statements. We denote the eigenvalues of the (real) 



matrix A by 










) ^s,i^ Ms,l ? 5 


7 f^S,j 1 } 


i + 2j = s, 


{A^^p , * 


) ^u,k-) ■ 


’ * * 5 f^U,l 5 } 


k + 21 = u. 


{Ac,i, • 


1 Mc,lj 


5 


m + 2n = c. 



where denote the real eigenvalues and denotes complex eigenval- 
ues. Since A is real, if p,a,j is an eigenvalue of A so is 

The generalized eigenspace corresponding to the (real) eigenvalue Xa^-y is 
defined as 

V {A, Aa^.^) = Ker {A — X^ ^ id)"“’^ , a = s,u,c, j = i,k, m, 

where Ua^-y is the (algebraic) multiplicity of the eigenvalue Ao,_.y (i.e., the 
number of times it appears as a root of the characteristic polynomial as- 
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sedated with A, det {A — Aid) = 0). The dimension of V (A, Xa,'y) is equal 

to Tio, ^'y . 

There is a bit of clumsiness with the notation here with respect to the 
eigenvalues with zero real part. The only way an eigenvalues can have zero 
real part, and be on the imaginary axis, is for it to be identically zero. 
Therefore Ac,i = 0, t = 1, . . . , m, and it follows that ric,i = m, z = 1, . . . , m. 

Complex eigenvalues are a bit more difficult to treat (actually, the dif- 
ficulty arises in the explanation; their treatment is fairly straightforward). 
Of course, the problem facing us is fairly evident. For a given complex 
eigenvalue iia,y we can define the corresponding generalized eigenspace in 
the usual way, i.e., 

V {A, fj,a,-y) = Ker {A — fj,a,y id)"“’^ , a = s,u,c, 7 = j, I, n, 

where Ua^y is the (algebraic) multiplicity of the eigenvalue Ha,y However, 
if one applies the same technique as one applies for real eigenvalues to 
compute the corresponding generalized eigenvectors one obtains complex 
generalized eigenvectors. Clearly, this is not a satisfactory situation since 
the phase space of our dynamical system is real. Nevertheless, one can 
derive a real basis for the generalized eigenspace corresponding to a complex 
eigenvalue from these “complex generalized eigenvectors” by adopting the 
complexification stratagem, which we now describe. First, we must give 
several definitions. 

Let us think of K" as a subset of C” in the natural way, i.e., K” is the 
set of complex n-tuples where each entry is a real number. 

Definition 3.1.1 Let F he a complex subspace o/C". Then Fr = F’nM" 
is the set of n-tuples in F that are real. Fr is referred to as the set of real 
vectors in F. 

It can easily be shown that Fr is closed under the operations of addition, 
as well as scalar multiplication by real numbers. Hence, Fr is a real vector 
space (a subspace of M"). 

Definition 3.1.2 (Complexification of a Subspace) Let E be a sub- 
space o/K”. Then 

Fc = G C” I z = ^ aiXi, Xi G F, Oi G c| , 

is called the complexification of F, where the sum denotes arbitrary, but 
finite, linear combinations. 

It is not hard to show that Fc is a complex subspace of C”, and that 
(-®c)r = E . 

Definition 3.1.3 (Complexification of a Real Linear Map) Suppose 
A : K" — >■ M" is a real linear map, and let E he a real subspace of M" that 
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is invariant under A. The complexification of A, denoted Ac, is a linear 
map of Ec into Ec, and is defined as follows. By the definition of Ec, any 
z € Ec can he represented as 

z = ^ aiXi, G E, ai G C, 

where the sum denotes arbitrary, hut finite, linear eomhinations. Then we 
define 

Acz = ^ aiAxi. 

Finally, we are at the point where we can define the generalized 
eigenspace corresponding to a complex eigenvalue p.a.-y This is given by 

V (A, /Tq, .y, /ta,7) = (1^ 0 ^ (^C; ^ 

a = s,u,c, ^ = j,l,n, 

where 

V (Ac,pta,'f') = Ker (Ac - . 

We then define the stable, unstable, and center subspaces as follows 
i j 



if® = ^ V (A, A,,y) + ^ c (A, Ai«, 7 , fisn ) , 

7=1 7=1 


(3.1.6) 


k 1 

if“ = V (A, A„^y) + V (A, Hu.j, , 

7=1 7=1 


(3.1.7) 


n 

if^ = U(A,0)+^U(A,Ale.7,/ic,7), 


(3.1.8) 



7=1 



where the sums are taken to be direct sums. By the primary decomposition 
theorem (Hirsch and Smale [1974]) we have 

K" = © ET 



3.1a Invariance of the Stable, Unstable, and 
Center Subspaces 

Let us outline how one ascertains the invariance of these subspaces under 
the linear flow given in (3.1.3) (this will provide some useful hints for the 
exercises). First, consider the matrix A associated with the linear vector 
field (3.1.2) as a linear map of K” into M". Clearly, if®, if“, and E'^ are 
invariant subspaces for this linear map since each is the subspace spanned 
by a particular collection of generalized eigenvectors. We want to argue that 
they are invariant under the linear map e^*. This relies on the following 
three facts that are proven in a basic linear algebra course. 

Suppose V C K" is a subspace and invariant under the linear map A. 
Then 
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• For any c G K, U is invariant with respect to cA. 

• For any integer n > 1, U is invariant with respect to A”. 

• Suppose Ai and A 2 are linear maps on K" and V is invariant with 
respect to both Ai and A 2 . Then V is invariant with respect to Ai + 
A 2 - From this result it also follows that for any finite number of 
linear maps Ai, i = 1, . . . ,n, with V invariant under each, V is also 
invariant under X)r=i 

Using each of these facts one can easily conclude that V is invariant 
under the linear map 

1 1 "1 

LJt) = id + At + -Ah^ + • • • + - = y -AH\ 

2 n! ^ ' i\ 

i=o 

for any n, where id is the n x n identity matrix (and 0! = 1). Now using 
the fact that V is closed, and that Ln{t) converges to uniformly, we 
conclude that V is invariant with respect to 



3.1b Some Examples 

We now illustrate these ideas with three examples where for simplicity and 
easier visualization we will work in 



Example 3.1.1. Suppose the three eigenvalues of A are real and distinct and 
denoted by Ai, A 2 < 0, As > 0. Then A has three linearly independent eigenvec- 
tors ei, 62 , and 63 corresponding to Ai, A 2 , and As, respectively. If we form the 
3x3 matrix T by taking as columns the eigenvectors ei, 62 , and 63 , which we 
write as 



then we have 



T = 




/ : 




A 


6l 


62 


63 


V ; 




; / 


0 


0 ^ 




A2 


0 


1 


0 


As J 





(3.1.9) 



(3.1.10) 



Recall that the solution of (3.1.2) through j/o G at t = 0 is given by 



y{t) = e^^yo 



TAT~^t 

e Vo- 



(3.1.11) 



Using (3.1.4), it is easy to see that (3.1.11) is the same as 
y{t) = Te^*T-'j/o 

0 0 \ 

0 6 ^ 2 * 0 T'^yo 

V 0 0 



= T 
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/ : 



eie 






626 



636 



T-^yo- 



(3.1.12) 



V : : ■ J 

Now we want to give a geometric interpretation to (3.1.12). Recall from (3.1.5) 
that we have 



E“ = spanjei, 62}, 
= spanjes}. 

Invariance 



Choose any point j/o £ Then T~^ is the transformation matrix which changes 
the coordinates of yo with respect to the standard basis on R® (i.e., (1,0,0), 
(0, 1, 0), (0, 0, 1)) into coordinates with respect to the basis ei, 62, and 63. Thus, 
for yo ^ E“, T~^yo has the form 

T~^yo = 

and, for yo G i?“, T~^yo has the form 

T~^yo = 

Therefore, by substituting (3.1.13) (resp., (3.1.14)) into (3.1.12), it is easy to see 
that yo G E" (resp., i?“) implies e^^yo G E" (resp., E“). Thus, E" and are 
invariant manifolds. 



yoi 

yo2 

0 



0 

0 

, yo3, 



(3.1.13) 



(3.1.14) 



Asymptotic Behavior 

Using (3.1.13) and (3.1.12), we can see that, for any yo G E“ , we have e^^yo 0 
as f — >■ +CX3 and, for any yo G i?“, we have e^^yo — >■ 0 as f — >■ —00 (hence the 
reason behind the names stable and unstable manifolds). 

See Figure 3.1.1 for an illustration of the geometry of E“ and 

End of Example 3.1.1 



Example 3.1.2. Suppose A has two complex conjugate eigenvalues p±iui, 
p < 0, w yf 0 and one real eigenvalue A > 0. Then A has three real generalized 
eigenvectors ei, 62, and 63, which can be used as the columns of a matrix T in 
order to transform A as follows 

/ p w 0\ 

A= -o) p 0 =T~^AT. (3.1.15) 

\ 0 0 A/ 

From Example 3.1.1 it is easy to see that in this example we have 
y{t) = Te^^T-^yo 

/ e^^coswt e'’^ s'mwt 0 \ 

= r( — e'^^sinoif e’’*cosu>t 0 \T~^yo. (3.1.16) 

\ 0 0 / 
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FIGURE 3.1.1. The geometry of E” and for Example 3.1.1. 

Using the same arguments given in Example 3.1.1 it should be clear that E“ = 
span{ei, 62 } is an invariant manifold of solutions that decay exponentially to zero 
as t — >■ + 00 , and = span{es} is an invariant manifold of solutions that decay 

exponentially to zero as t ^ —00 (see Figure 3.1.2). 



End of Example 3.1.2 



Example 3.1.3. Suppose A has two real repeated eigenvalues, A < 0, and a 
third distinct eigenvalue 7 > 0 such that there exist generalized eigenvectors ei, 
62 , and 63 which can be used to form the columns of a matrix T so that A is 
transformed as follows 

(\ 1 0 \ 

A= 0 A 0 =T~^AT. (3.1.17) 

\0 0 7/ 

Following Examples 3.1.1 and 3.1.2, in this example the solution through the 
point yo £ at t = 0 is given by 

y{t) = Te^‘r-ij /0 

/e^‘ 0 \ 

= T 0 0 ]T~^yo- (3.1.18) 

V 0 0 

Using the same arguments as in Example 3.1.1, it is easy to see that E“ = 
spanjei, 62 } is an invariant manifold of solutions that decay to y = 0 as t — >■ + 00 , 
and = spanjes} is an invariant manifold of solutions that decay to j/ = 0 as 
t —00 (see Figure 3.1.3). 



End of Example 3.1.3 
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FIGURE 3.1.2. The geometry of E“ and for Example 3.1.2(for u) < 0). 




FIGURE 3.1.3. The geometry of E" and E“ for Example 3.1.3 

The reader should review enough linear algebra so that he or she can 
justify each step in the arguments given in these examples. We remark 
that we have not considered an example of a linear vector field having a 
center subspace. The reader can construct his or her own examples from 
Example 3.1.2 by setting p = 0 or from Example 3.1.3 by setting A = 0; we 
leave these as exercises and now turn to the nonlinear system. 
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3.2 Stable, Unstable, and Center Manifolds for 

Fixed Points of Nonlinear, Autonomous Vector 
Fields 



Recall that our original motivation for studying the linear system 

y = Ay, j/eM”, (3.2.1) 

where A = Df{x), was to obtain information about the nature of solutions 
near the fixed point a; = a; of the nonlinear equation 

i = f(x), a;eK". (3.2.2) 

The stable, unstable, and center manifold theorem provides an answer to 
this question; let us first transform (3.2.2) to a more convenient form. 

We first transform the fixed point a; = a; of (3.2.2) to the origin via the 
translation y = x — x. In this case (3.2.2) becomes 

y=f{x + y), 1 /GK”. (3.2.3) 

Taylor expanding f{x + y) about x = x gives 

y = DJ{x)y + R{y), y G R”, (3.2.4) 

where R{y) = 0{\y\^) and we have used f{x) = 0. From elementary linear 
algebra (see Hirsch and Smale [1974]) we can find a linear transformation 
T which transforms the linear equation (3.2.1) into block diagonal form 

R, 0 0 \ /m\ 

0 Au 0 u , (3.2.5) 

0 0 aJ \wJ 

where T~^y = {u,v,w) G M® x R“ x R'^, s + u + c = n, is an s x s 
matrix having eigenvalues with negative real part, A^ is an m x u ma- 
trix having eigenvalues with positive real part, and Re is an c x c matrix 
having eigenvalues with zero real part (note: we point out the (hopefully) 
obvious fact that the “0” in (3.2.5) are not scalar zero’s but rather the 
appropriately sized block consisting of all zero’s. This notation will be used 
throughout the book). Using this same linear transformation to transform 
the coordinates of the nonlinear vector field (3.2.4) gives the equation 

u = AsU + Rs{u, V, w), 

V = AuV + Ru{u,v,w), (3.2.6) 

w = AcW + Rc{u, V, w), 




where Rs{u,v,w), Ru{u,v,w), and Rc{u,v,w) are the first s, u, and c 
components, respectively, of the vector T~^ R{Ry) . 
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Now consider the linear vector field (3.2.5). From our previous discussion 
(3.2.5) has an s-dimensional invariant stable manifold, a u-dimensional in- 
variant unstable manifold, and a c-dimensional invariant center manifold all 
intersecting in the origin. The following theorem shows how this structure 
changes when the nonlinear vector field (3.2.6) is considered. 

Theorem 3.2.1 (Local Stable, Unstable, and Center Manifolds of 
Fixed Points) Suppose (3.2.6) is O'", r > 2. Then the fixed point 
(u,v,w) = 0 of (3.2.6) possesses a C’’ s-dimensional local, invariant stable 
manifold, a C’’ u-dimensional local, invariant unstable manifold, 

and a C’’ c-dimensional local, invariant center manifold, 
all intersecting at (u,v,w) = 0. These manifolds are all tangent to the re- 
spective invariant subspaces of the linear vector field (3.2.5) at the origin 
and, hence, are locally representable as graphs. In particular, we have 

= {(w,t;,w) gK" xM“ xM“|u = /i^(u),w = /i^(m); 

Dhl{0) = 0,I?/i^(0) = 0; |m| sufficiently small} 



WOM = {(w, u, tc) G K* X X I w = hl{v),w = hHv); 
Uft,“(0) = 0,U/i“(0) = 0; |u| sufficiently small} 



= {{u,v,w) eW xR^ xR‘=\u = hl{w),v = hl{w)-, 

U/iO(0) = 0,Dh'){0) = 0; |t(;| sufficiently small} 

where h%{u), h’fiffi), h)^{v), hf„{v), hf{w), and h%{w) are C’’ functions. 
Moreover, trajectories in lFj®^(0) and Wj“j.(0) have the same asymptotic 
properties as trajectories in and U“, respectively. Namely, trajectories 
of (3.2.6) with initial conditions in IFjq^(O) (resp., approach the 

origin at an exponential rate asymptotically as t ^ -l-oo (resp., t — >■ —oo). 

Proof: See Fenichel [1971], Hirsch, Pugh, and Shub [1977], or Wiggins [1994] 
for details as well as for some history and further references on invariant 
manifolds. □ 



Some remarks on this important theorem are now in order. 

Remark 1. First some terminology. Very often one hears the terms “stable 
manifold,” “unstable manifold,” or “center manifold” used alone; however, 
alone they are not sufficient to describe the dynamical situation. Notice 
that Theorem 3.2.1 is entitled stable, unstable, and center manifolds of 
fixed points. The phrase “of fixed points” is the key: one must say the 
stable, unstable, or center manifold of something in order to make sense. 
The “somethings” studied thus far have been fixed points; however, more 
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general invariant sets also have stable, unstable, and center manifolds. See 
Wiggins [1994] for a discussion. 

Remark 2. The conditions Dh^{0) = 0, Dh%{0) = 0, etc., reflect that the 
nonlinear manifolds are tangent to the associated linear manifolds at the 
origin. 

Remark 3. In the statement of the theorem the term local, invariant sta- 
ble, unstable, or center manifold is used. This deserves further explanation. 
“Local” refers to the fact that the manifold is only defined in the neigh- 
borhood of the fixed point as a graph. Consequently, these manifolds have 
a boundary. They are therefore only locally invariant in the sense that tra- 
jectories that start on them may leave the local manifold, but only through 
crossing the boundary. Invariance is still manifested by the vector held 
being tangent to the manifolds, which we discuss further below. 

Remark 4- Suppose the fixed point is hyperbolic, i.e., = 0. In this case 

an interpretation of the theorem is that trajectories of the nonlinear vector 
held in a sufficiently small neighborhood of the origin behave the same as 
trajectories of the associated linear vector held. 

Remark 5. In general, the behavior of trajectories in in Wj°j.(0) cannot be 
inferred from the behavior of trajectories in 

Remark 6. Uniqueness of Stable, Unstable, and Center Manifolds. Typi- 
cally the existence of these invariant manifolds are proved through a con- 
traction mapping argument, where the invariant manifold turns out to be 
the unique fixed point of an appropriately constructed contraction map. 
From this construction the stable and unstable manifolds are unique. The 
center manifold is a bit more delicate. In that case, because of the nonhy- 
perbolicity, a “cut-off” function is typically used in the construction of the 
appropriate contraction map. In this case the center manifold does depend 
upon the cut-off function. However, it can be shown that the center mani- 
fold is unique to all orders of its Taylor expansion. That is, center manifolds 
only differ by exponentially small functions of the distance from the fixed 
point. See Wan [1977], Sijbrand [1985] and Wiggins [1994]. 

3.2a Invariance of the Graph of a Function: 

TANGENCY of THE VECTOR FIELD TO THE GRAPH 

Suppose one has a general surface, or manifold and one wants to check if it 
is invariant with respect to the dynamics generated by a vector held. How 
can this be done? 

Suppose the vector held is of the form 

i = f{x,y), 

y = g{x,y), (a;, y) G M" x R'". 

Suppose that the surface in the phase space is represented by the graph of 
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a function 

y = h{x), 

This surface is invariant if the vector field is tangent to the surface. This 
tangency condition is expressed as follows 

Dh{x)x = y, 



or, 

Dh{x)f{x, h{x)) = g{x, h{x)). (3.2.7) 

Of course, one must take care that all the functions taking part in these 
expressions have common domains, and that the appropriate derivatives 
exist. It is also very important to appreciate the role that specific coordinate 
representations played in deriving this expression. 



3.3 Maps 

An identical theory can be developed for maps. We summarize the details 
below. Consider a C’’ diffeomorphism 

X !->■ g{x), X G K". (3.3.1) 

Suppose (3.3.1) has a fixed point ai x = x and we want to know the nature 
of orbits near this fixed point. Then it is natural to consider the associated 
linear map 

y Tly, y G K”, (3.3.2) 

where A = Dg(x). The linear map (3.3.2) has invariant manifolds given by 

A® = spanjei,- • • ,6*}, 

= span{cs+i, • • • , Cs+u}, 

^ — span{es-|_^-i_i , * * * , Cs+u+c}? 

where s + u + c = n and ei , • • • , are the (generalized) eigenvectors of 
A corresponding to the eigenvalues of A having modulus less than one, 
Cs+i, • • • , €s+u are the (generalized) eigenvectors of A corresponding to the 
eigenvalues of A having modulus greater than one, and Cs+u+i, ■ ■ ■ , e«+u+c 
are the (generalized) eigenvectors of A corresponding to the eigenvalues of 
A having modulus equal to one. The reader should find it easy to prove this 
by putting A in Jordan canonical form and noting that the orbit of the 
linear map (3.3.2) through the point j/o G 1^" is given by 

{• • • , A~^yo, ■■■, A~^yo, yo, Ayo, ■■■, A^yo, ■ ■ •}■ (3.3.3) 

Now we address the question of how this structure goes over to the non- 
linear map (3.3.1). In the case of maps Theorem 3.2.1 holds identically. 
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FIGURE 3.3.1. Local invariant manifold structure in the unforced Duffing oscil- 
lator, 0 < 5 < \/8. 

Namely, the nonlinear map (3.3.1) has a C’’ invariant s-dimensional sta- 
ble manifold, a O’" invariant u-dimensional unstable manifold, and a C’’ 
invariant c-dimensional center manifold all intersecting in the fixed point. 
Moreover, these manifolds are all tangent to the respective invariant man- 
ifolds of the linear map (3.3.2) at the fixed point. 

Essentially, everything about stable, unstable, and center manifolds for 
fixed points of vector fields holds for fixed points of maps. We will give 
examples in the exercises. However, before completing our discussion of in- 
variant manifolds let us apply our results to the unforced Duffing oscillator. 



3.4 Some Examples 

Example 3.4.1 (Application to the Unforced Duffing Oscillator). In Section 1 
we have seen that the equation 

x = y, 

y = X — — Sy, 5 > 0, 

has a saddle-type fixed point of {x, y) = (0, 0), and sinks at (±1, 0) for 5 > 0. From 
Theorem 3.2.1 we now know that (±1,0) have two-dimensional stable manifolds 
(this is obvious) and (0, 0) has a one-dimensional stable manifold and a one- 
dimensional unstable manifold as shown in Figure 3.3.1 (note: we have drawn 
the figure for 0 < <5 < %/8. The reader should show how the solutions near the 
sinks are modffied for <5 > \/8). Note that Theorem 3.2.1 also tells us that a good 
local approximation to the stable and unstable manifolds of (0, 0) is given by the 
corresponding invariant linear manifolds, which are relatively easy to calculate. 
The case d = 0 is treated in great detail in Chapter 5. 



End of Example 3.4.1 
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Let us consider a final example from Guckenheimer and Holmes [1983]. 




FIGURE 3.4.1. The stable and unstable subspaces in Example 3.4.2. 



Example 3.4.2. Consider the planar vector field 

X = X, 

y = -y + x^, (a:, j/) € X 

which has a hyperbolic fixed point at (x,y) = (0,0). The associated linearized 
system is given by 

X = X, 

V = -y, 

with stable and unstable subspaces given by 

-B'" = { I a: = 0}, 

E“ = {(a:,j/)eR^|y = 0} 



(see Figure 3.4.1). 

Now we turn our attention to the nonlinear vector field for which, in this 
case, the solution can be obtained explicitly as follows. Eliminating time as the 
independent variable gives 

y = ^ = ZV + ^ 

• J “T ) 

X ax X 

which can be solved to obtain 

, \ x^ c 

J/(o:) = y + ^, 

where c is some constant. Now lFi“;.(0, 0) can be represented by a graph over 
the X variables, i.e., y = h(x) with h{Q) = h! {0) = 0. Varying c in the solution 
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FIGURE 3.4.2. Stable and unstable manifolds of ix,y) = (0,0) in Example 3.4.2. 



above takes us from orbit to orbit; we seek the value of c which corresponds to 
the unstable manifold — this is c = 0. Therefore, we have 



VEioc(0,0) 



{x,y) e 




which is also the global unstable manifold of the origin. Finally, note that if 
we have initial conditions with the x component equal to zero, i.e., (0,y) Vy, 
then the solution stays on the y axis and approaches (0,0) as t f oo; thus, 
E‘^ = IU'’(0, 0) = { (a;, y) I a; = 0 } (see Figure 3.4.2). 



End of Example 3.4.2 



3.5 Existence of Invariant Manifolds: The Main 
Methods of Proof, and How They Work 

In this section we will describe the two main techniques for proving the 
existence of invariant manifolds. In particular, we will concentrate on how 
one proves the existence of stable, unstable, and center manifolds of fixed 
points, although the same approach works in more general settings. There 
are many excellent expositions of the proofs of these theorems. Our ap- 
proach will be to show how they work in a concrete example where the 
answer is know explicitly. In this way we hope that the reader will be able 
to gain intuition concerning the key features of the problem that enables 
the procedure of the proof to work. 

Of course, one might ask the question, “why do I need to know how to 
prove the stable and unstable manifold theorem?” There are two answers to 
this question. One is that the techniques of proof are often used as the basis 
for numerical approaches to computing stable and unstable manifolds. Even 
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if one has no interest in the details of the proof of the theorem, in using it 
in applications one often needs to numerically compute the manifolds. An 
overview of such numerical methods can be found in Moore and Hubert 
[1999] and Osinga [1996]. The other answer is that in a specific application 
the precise setting or hypotheses of the theorem may not be satisfied (e.g., 
the equilibrium point may be a saddle, but not hyperbolic, or the invariant 
set may be more complicated than an equilibrium point). In this situation 
it may be possible to prove a new theorem by following the same general 
approach, but with modifying certain details. 

The main techniques for proving the existence of invariant manifolds are 
the following. 

Hadamard’s Method-The Graph Transform 

Hadamard [1901] developed this method to prove the existence of stable 
and unstable manifolds of a fixed point of a C’’ invertible map. The graph 
transform method is more geometrical in nature than the Liapunov-Perron 
method. In the context of a hyperbolic fixed point, the stable and unstable 
manifolds are constructed as graphs over the linearized stable and unstable 
subspaces, respectively-hence the name. Fenichel [1971], [1974], [1977] and 
Hirsch, Pugh, and Shub [1978] used this method in obtaining their general 
results on normally hyperbolic invariant manifolds. An elementary and de- 
tailed exposition of the graph transform method following Fenichel can be 
found in Wiggins [1994]. 

The Liapunov- Perron Method 

Perron [1928], [1929], [1930] and Liapunov [1947] developed a method 
for proving the existence of stable and unstable manifolds of a hyperbolic 
equilbrium point. It deals with the integral equation formulation of the 
ordinary differential equations and constructs the invariant manifolds as a 
fixed point of an operator that is derived from this integral equation on a 
function space whose elements have the appropriate interpretation as sta- 
ble and unstable manifolds. The Liapunov-Perron method has been used in 
many different situations. The book of Hale [1980] is a good reference point 
for surveying these applications. In this book Hale surveys the fundamen- 
tal earlier work of Krylov and Bogoliubov, Bogoliubov and Mitropolski, 
Diliberto, Kyner, Kurzweil, and Pliss. In Chapter 7 of Hale [1980] several 
theorems related to various aspects of invariant manifolds are given which 
we refer to collectively as the “Hale invariant manifold theorem”. These 
results can be viewed as a generalization and extension of much of the ear- 
lier work. Much of this work has recently been generalized in Yi [1993a,b]. 
Chicone [1999] has an excellent elementary and detailed exposition of the 
Liapunov-Perron method. 
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3.5a Application of These Two Methods to a 
Concrete Example: Existence of the 
Unstable Manifold 

We recall Example 3.4.2: 

X = X, 

y = -y + x^, (x,y)GK^. (3.5.1) 

This vector field has a hyperbolic fixed point at the origin with unstable 
manifold given by 

y=y. (3.5.2) 

Now we will prove the existence of the unstable manifold by the graph 
transform method and the Liapunov-Perron method. 

Application of the Graph Transform Method 

The trajectory of (3.5.1) through the point (xo,yo) at t = 0 is easily calcu- 
lated and found to be: 



x(t; xo,yo) = xoe\ 

y{t; xo, yo) = yoe~* + - l) . (3.5.3) 

We view the trajectories as mapping initial conditions at t = 0 to points 
at the time t; often this is referred to as the time-t map. 

Now consider a function y = h{x), and we consider an initial condition 
(xo,yo) on the graph of this function, i.e., yo = h(xo). The image of this 
initial condition under the time-t map is given by 

(xo, h{xo)) (x(t; Xo, h{xo)),y{t; xq, h{xo))) ■ (3.5.4) 

Now if the graph of y = h{x) was an invariant manifold then, for any t, we 
would have: 

y{t; Xo, h{xo)) = h{x{t; xo, h{xo))). (3.5.5) 

This motivates us to define the following graph transform, Q: 

Qh (x(t; Xo, ^(xo))) = y{t; xo, h{xo)), (3.5.6) 

where from (3.5.5) we see that the graph of a function y = h{x) is invariant 
under the time-t map if it is a fixed point of the graph transform. 

Now we are getting ahead of ourselves and we need to go back and 
develop the mathematical setting where the phrase “fixed point of the 
graph transform” makes sense. 
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We define the following space of functions: 

Si = {the set of functions, /i(a;), where \h{x) — h{x')\ < 5\x — x' \, 

\x\ < e, 5 > 0} . 

(3.5.7) 

In other words, these are the set of Lipshitz functions with Lipschitz con- 
stant 5 defined on the domain |a;| < e. These two parameters, 5 and e are 
adjusted in order to make the proof work. We can put a norm on Ss that 
is defined as follows: 

II h 11= sup \h{x)\. 

\x\<e 

With this norm Ss becomes a Banach space, but more importantly, it 
becomes a complete metric space in the metric that is defined by this norm. 
This latter fact is important for using the contraction mapping principle. 

The complete metric space Ss will be the domain on which the graph 
transform is defined. Ultimately, we want to show that the graph transform 
has a fixed point in Ss which, as we argued above, will be an invariant 
manifold. However first, several facts must be established. 

The Graph Transform is Well-Defined: 

We need to show that for any h £ Ss, Gh is a function. Using (3.5.3) and 
(3.5.6), we have 

Gh{x{t] xo, h(xo))) = h{xo)e~* + - l) . 

Clearly, this is a well-defined function. 
g ■■ Ss^ Ss: 

This is the first step in showing that t/ is a contraction map on Ss- 
First, we need a preliminary result. Define 

f = x{t; Xo, h{xo)) = xoe*, f,' = x(t; Xq, /i(xq)) = x'^e*. 

Then, for t > 0, we have 

k-ri > ko-a:'o|. (3.5.8) 

Next we calculate: 

\GHf) - Gh{f,')\ < e“‘|h(xo) - /i(x'o)| -k (e^‘ - l) |xq -k x'o||xo - Xq|, 

< -k - l) |xq -k Xq|^ |xq - Xq|, 

< (^Se~^ + - l) |xq -k Xq|^ |C - f'\, 

<(<5e-‘+?e-‘ e) |^ - ^1 



(3.5.9) 
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Now for fixed f > 0, we can choose S and e small enough such that 

(^e-* + - l) < 5. 

Q is a Contraction Map on Ss-' 

Choose a;o and x'q such that 

^ = x{t;xo,hi{xo)) = x{t;xQ,h2{xQ)). (3.5.10) 

Since x(t; xq, hi{xo)) = xoe*, x{t; Xq, h2{x'o)) = XqC* this implies that 

xq = x'q. (3.5.11) 



Now for ft-i, ft -2 G Ss we have 

\Qhi{i) - Qh2{S,)\ < e~^\hi{xo) - h2{xo)\ 

<e-‘ II /ii-h2 II, (3.5.12) 

which is a contraction for t > 0. Therefore by the contraction mapping 
theorem, the graph transform has a unique fixed point, h{x), |x| < e. By 
construction, the graph of h{x) is invariant under the time-t map, for some 
fixed t > Q. 

y = h{x) is Invariant: 

There are two important points that need to be addressed related to the 
issue of invariance. 

We have argued that y = h{x) is invariant under the discrete time-t map, 
for some fixed t > 0. ^ There is a technicality involved with this statement. 
One sees from the form of the trajectories of the vector field given in (3.5.3) 
that the x component of any trajectory grows exponentially in time. That 
means that any point starting on y = h{x) will eventually leave y = h{x) 
(for t > 0), but only by crossing the boundary, which is given by the two 
points J/+ = h{+e) and y_ = h{—e). The graph of h{x) is an example 
of a locally invariant manifold. That is, points starting on the graph of 
h{x) remain on the graph during time evolution, except they may leave 



^Notice that the subscript “0” has disappeared from xo in our notation for 
the domain of the graph of the function which gives the local unstable manifold. 
This seemingly trivial point is one worth considering in a bit more detail. The 
subscript “0” makes explicit the fact that the invariant manifolds are manifolds of 
initial conditions for trajectories. This understanding is important and is similar 
to the understanding of how trajectories are different from orbits. Once this is 
made clear, the subscript “0” can be dropped since initial conditions are just 
points in the phase space. However, in setting up the problem, one needs to have 
the subscript so as not to confuse the notation for the initial condition with that 
for the trajectory itself. 
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the graph but only by crossing the boundary. This situation is typical for 
local stable, unstable, and center manifolds associated with fixed points 
since these manifolds are constructed in small neighborhoods of the fixed 
point as graphs over the respective linearized stable, unstable, and center 
subspaces. Therefore the domains of the graphs are finite, which generally 
gives rise to a boundary for these local invariant manifolds. Shortly we 
will show how one turns these local manifolds into global manifolds (and 
describe what that means). 

The second important point concerning invariance is that we need to 
establish that points that start on y = h{x) stay on it for any time, unless 
they leave it by crossing the boundary. Our graph transform proof has only 
shown that the graph of h{x) is invariant with respect to the time-t map for 
some fixed t > 0. We leave the details of this as an exercise for the reader, 
but see Hartman [1964], Fenichel [1971], or Wiggins [1994] for details. 

Continuation: The Global Unstable Manifold: 

Once the existence of the local unstable manifold is established, we can 
then construct the global unstable manifold by letting the local unstable 
manifold evolve in time under the time-t map. More precisely, the global 
unstable manifold of the origin is given by: 

IT“ = [x{t;x,h{x)),y(t;x,h{x))) . (3.5.13) 

t>Q 

\x\<e 

Clearly, this is object is invariant since it is a union of trajectories. It is also 
one dimensional, and trajectories with initial conditions in approach 
the origin at an exponential rate as t — >■ — oo. 

Differentiability of the Unstable Manifold: 

The local unstable manifold found by the graph transform is the graph of 
a Lipschitz function. However, the stable, unstable, and center manifold 
theorem for a fixed point states that if the vector field (or map) is C’’, then 
the unstable manifold should also be C’’ (and our example is C°°). 

The procedure for showing that y = h{x) is is as follows. The fixed 
point of the graph transform satisfies (3.5.5). We formally differentiate 
(3.5.5) with respect to xq to obtain: 

Dh [Dix + D 2 X Dh) = Diy + I?2j/ Dh, 

or 

Dh= {Diy + D2yDh) {Dix + D 2 X Dh)~^ . (3.5.14) 

Here we are leaving out the arguments of all the functions in (3.5.5). The 
notation D denotes the derivative with respect to xg, D\ denotes the partial 
derivative with respect to Xg and D 2 denotes the partial derivative with 
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respect to yo- The phrase “formally differentiate’ used above means to do 
exactly what we did to (3.5.5), which resulted in (3.5.14). We do not know 
that (3.5.14) has any meaning since it has not yet been shown that h{x) is 
differentiable. 

Therefore we define the operator 

Hv = {Diy + D 2 yv) {Dix + D 2 xv)~^ , (3.5.15) 

where w(-) is the unknown function. Clearly, a fixed point of this operator is 
a solution of (3.5.14). We use contraction mapping techniques to show that 
TL does have a fixed point. Then we show that this fixed point is actually 
the derivative of the fixed point of the graph transform. This will show that 
the fixed point of the graph transform is . One then proceeds inductively 
to obtain higher order derivatives. The details can be found in Hartman 
[1964], Fenichel [1971], or Wiggins [1994]. 

This approach yields that the local unstable manifold is differentiable. 
What about the global unstable manifold defined in (3.5.13)? This will 
follow from Theorem 7.1.1 in Chapter 7 which says that the trajectories of 
a C"" vector field are C’’ functions of the initial conditions. Since (3.5.13) is 
constructed by mapping the initial conditions of the local unstable manifold 
by the trajectories it will then follow that the global unstable manifold is 
C^. 

Application of the Liapunov-Perron Method 

Now we develop the Liapunov-Perron method for proving the existence of 
the unstable manifold of the origin for the same example. The Liapunov- 
Perron method uses the integral equation form of the ordinary differential 
equation (3.5.1), since (3.5.1) can be equivalently written in the form: 

x{t; xo,yo) = xoe\ 

y{t-,xo,yo) = yoe~* + e~^ e'^ x‘^{t', xo,yo)dT. (3.5.16) 

Jo 

Suppose the graph of a Lipschitz function y = h{x), with h{0) = 0 and 
Lipschitz constant 6, is an invariant set for (3.5.1). Then y{t; xq, h{xo)) = 
h{x{t] xo, h(xo))) is a solution of 

y = -y + X^ (3.5.17) 

with y(0; Xo, h(xo)) = h(xo)- Using (3.5.16), (3.5.17) can be rewritten as 

y(t; xq, h(xo)) = h(xo)e~^ -h e~^ / e'^x^(r; xq, h(xo))dr, (3.5.18) 

Jo 

or ^ 

h{xo) = e^y{t;xo,h{xo)) - / e''x‘^{r;xo,h{xo))dT. (3.5.19) 

^0 
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Now 

\y{t] xo, h{xo))\ = \h{x{t] Xq, /i(a;o)))l < S\x{t; Xq, h{xo))\ = 5e*|xo|, 
from which it follows that 



lim e*y{t] x, h(x)) = 0. 

t—¥ — 00 



Therefore, taking the limit as t ^ —oo in (3.5.19) gives: 
h{xo) = / e'^x'^{T]Xo,h{xo))dT, 

J — OO 



(3.5.20) 



' — OO 

rO 



e'^Xge^'^dT = — , 

0 3 



which is exactly the unstable manifold of the origin. Conversely, one can 
show that if h{xo) satisfies (3.5.20), then the graph oiy = h{x) is invariant. 
This motivates the following definition of the Liapunov- Perron operator. 

.0 

'Ph(xo) = / e'^x^(r; Xq, ft.(xo))dr, (3.5.21) 

J — OO 

which is defined on the complete metric space 

Sg = {the set of functions, /i(x), where \h{x) — h{x')\ < i5|x — x'|, 

/i(0) = 0, |x| < e, i5 > 0} , 

(3.5.22) 

with metric derived from the following norm 

II h 11= sup \h{x)\. 

\x\<e 



By our construction, a fixed point of the Liapunov-Perron operator is the 
unstable manifold of the origin. We will show that the Liapunov- Perron 
operator is a contraction map on Sg. As for the graph transform, several 
steps must be carried out in the course of establishing this fact. Of course, 
all of this is unneccessary for this example since we have already found 
the fixed point for the Liapunov- Perron operator, which is exactly the 
global unstable manifold of the origin. However, our goal here is to give 
a description of the method in general. With that in mind, we proceed 
accordingly. 

V is Well-Defined: 

This is obvious for this example from the form of (3.5.21). 
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Using (3.5.21) and the first equation of (3.5.16), we have 

\Vh{xQ) - Vh{xQ)\ < + a^ol \xq - Xq] < y |xq - Xq|, (3.5.23) 

so we need only choose e small enough such that y < <5. 

P is a Contraction Map on 5°: 

Using (3.5.21) and the first equation of (3.5.16), we have 



\Vhi{xo)-Vh2{xo)\ = Q, (3.5.24) 

So V is clearly a contraction map. By the contraction mapping theorem 
it therefore has a unique fixed point whose graph (by construction) is the 
invariant, local unstable manifold of the origin. 

Once the existence of the local unstable manifold is established then it 
can be continued to a global unstable manifold in the same way as was 
done for the local unstable manifold obtained by the graph transform. It 
can also be shown that the unstable manifold is differentiable in a way 
that was similar to that described for the graph transform. That is, one 
formally differentiates the Lipaunov-Perron operator and derives equations 
that the derivatives must satisfy. Fixed point methods are then use to show 
that these equations have solutions, and that the solutions are indeed the 
derivatives. See Chicone [1999] for details. 

Computing Invariant Manifolds using Taylor Expansions 

Once we know the existence of a C’’ invariant manifold we can use Tay- 
lor series methods to compute it. We will illustrate this in the context of 
example (3.5.1). Suppose the graph of y = h{x) is an invariant manifold. 
Then 

y = Dh{x)x, (3.5.25) 

or 

Dh{x)x-y = 0. (3.5.26) 

Since /i(0) = 0 (the invariant manifold passes through the fixed points at 
the origin) and Dh{0) = 0 (the invariant manifold is tangent to the unstable 
subspace of the origin, i.e., the a;-axis), we can assume that h{x) has the 
following form (provided the vector field is at least C^, but our example is 
C“): 

y = h{x) = ax^ + bx^ + -!-•••. (3.5.27) 

Substituting the expression for x and y from (3.5.1) into (3.5.26) gives: 

Dh{x)x — (—h{x) + x"^) = 0. (3.5.28) 
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Substituting (3.5.27) into (3.5.28) gives 

(2ax^ + 36x^ + + 0(5)) + ax^ + hx^ + cx'^ + 0(5) — = 0 (3.5.29) 

In order to solve this equation the coefficient multiplying each power of x 
must be zero. This implies: 

x"^ : 2a + a— l = 0=l>a=i 
36 + 6= 0 ^6=0 

x^:4c + c = 0 =l>c=0 

In fact, one easily sees that the coefficient on the nth order term, for any 
n > 2, must be zero. Hence we have 



y = h{x) = y. 

Simo [1990] and Beyn and Kless [1998] show how the Taylor method can 
be numerically implemented for computing invariant manifolds. 



3.6 Time-Dependent Hyperbolic Trajectories and 
their Stable and Unstable Manifolds 

In the previous sections we described the notion of hyperbolicity of a fixed 
point of an autonomous vector field, as well as the existence of stable, un- 
stable, and center manifolds associated with a fixed point (a parallel theory 
for maps also exists, but at the moment we are considering the continuous 
time case of vector fields) . All of these ideas and results were a result of the 
behavior of the vector field linearized about the fixed point. More precisely, 
they followed from the eigenvalue and generalized eigenspace structure of 
the constant matrix associated with the linearized vector field. Now we 
want to consider a general time dependent trajectory. What does it mean 
for such a trajectory to be hyperbolic? Does a hyperbolic trajectory have 
stable and unstable manifolds? These are questions that we now address. 

We consider vector fields of the form 

x = f{x,t), a; G [/ C K", 6 G K, (3.6.1) 

where U is some open set in M". 

The basic existence and uniqueness results for particle trajectories to or- 
dinary differential equations do not require very stringent requirements on 
the time dependence (these will be discussed in detail in Chapter 7) ; con- 
tinuity in t is sufficient (see Hale [1980]). However, the dependence on x is 
more stringent. We describe this more precisely in the following assumption 
that will stand throughout this chapter, unless otherwise stated. 
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Regularity Assumptions 

We assume that f{x, t) is continuous in t, for all t € M, and in x, r > 1. 
Moreover, all of the partial derivatives of v{x,f), with respect to x, up to 
order r, are uniformly continuous and uniformly bounded in Kx'R, where K 
is an arbitrary, compact subset ofU. These “regularity assumptions” on the 
velocity field will be sufficient for the invariant manifold theorems described 
below (Coddington and Levinson [1955], Hale [1980], or Yi [1993a, b]). 

3.6a Hyperbolic Trajectories 

We have seen that if the matrix associated with the linearization about a 
trajectory is time dependent, then the eigenvalues of this matrix need not 
give information about local stability of the trajectory. The appropriate 
notion for characterizing the local (linearized) behavior in this case is that of 
an exponential dichotomy (although Liapunov exponents are also relevant, 
which we will mention later), which we now define, and for which the 
standard reference is Coppel [1978]. 

Definition 3.6.1 (Exponential Dichotomy) Consider the following lin- 
ear ordinary differential equation with time dependent coefficients 

f = A{t)f, C G M", (3.6.2) 

where A{f) is a continuous function o/ t G K. Suppose X{t) is the fun- 

damental solution matrix of (3.6.2), i.e., for any initial condition 
f{t) = X{t)f,o is the solution of (3.6.2) passing through at t = 0, 
A(0) = id. Let || • || denote a norm on M". Then (3.6.2) is said to possess 
an exponential dichotomy if there exists a projection operator P, = P, 
and constants K\, K 2 , Ai, A 2 > 0, such that 

II A(f)PA“^(r) II < Ai exp (— Ai(t — r)) , t > r, (3.6.3) 

II A(t) (id- P) A“^(r) II < A 2 exp (A 2 (t - r)) , t<r. (3.6.4) 



We now give a simple example of a linear autonomous vector field that 
illustrates this notion of exponential dichotomy. In this case we know that 
the eigenvalues of the matrix used to define the linear vector field give us 
information about the local stability. Therefore one can relate this notion 
with the new idea of an exponential dichotomy. 

Example 3.6.1. Consider the following two dimensional, steady linear velocity 
field: 



X = —Ax, 

y = Ay, (x,t/)GR^, 



A > 0. 



(3.6.5) 
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The fundamental solution matrix is given by 



X{t) = 




(3.6.6) 



and we take 




(Clearly, — P.) Then we have 



and 



X(t)PX-(r)=(^ 0 2)’ 

X(t)(id-P)X-i(r)= • 



We can take as a norm on the set of real 2x2 matrices the absolute value of the 
largest matrix element. In that case we see that the bounds given in (3.6.1) are 
obeyed for any Ki = K 2 > 1. 



End of Example 3.6.1 

We now consider a less trivial time-dependent example. 

Example 3.6.2. 

Consider the following linear vector field with time-periodic coefficients that 
was earlier considered in Example 1.2.1: 



where 



A(t) = 



(r 


) = m ( 


f Xl 






-1-b 


1 cos^ t 


1 - 


1 _ 3 
^ 2 


cos t sin t 


-1 



(3.6.7) 



This is an example that we have seen earlier. The eigenvalues of A{t) are found 
to be independent of t and are given by 



Ai(t) = 






\2(t) = 






In particular, they have negative real parts for all t. However, one can verify that 
the following are two linearly independent solutions of this equation 



/ — cost 
vi{t) = ■ ^ 

' smt 



- . , sint X _t 

V 2 {t) = , e 

' cost 



Therefore the fundamental solution matrix is given by: 



, I —e^ cost e Sint 

X(t) — \ t , 

e2 Sint e cost 



(3.6.8) 



(3.6.9) 
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with inverse 



X-\t) = 



—e 2 COST e 2 sinr 
e’’ sin r cos r 



We take as the projection operator: 

P = 

Then we have 



00 ' 

0 1 y 



1 



e*-’’ sin r sin t cos r sin t ' 

sinr cost cost cost I ’ 



from which it follows that 

II X{t)PX-\T) ||< 



(3.6.10) 



(3.6.11) 



(3.6.12) 



(3.6.13) 



where for a norm on the space of 2 x 2 matrices we have taken the maximum of 
the absolute value of the matrix elements. 

Similarly, we have 



X{t) {id- P)X~^{t) 



( g2( '^■’■‘yos r COS t 

6 2 (“’■+*) COST sin t 



62 

i 

e2 



^ ’’^‘yinrcost 

(-T-i-t) gin.gsjjji 



(3.6.14) 



from which it follows that 

II X(t) (id- P)X'^(r) ||< t<T. (3.6.15) 

Taking K\ — K 2 — 1 and Ai = A 2 = | we see from (3.6.1) that this equation has 
an exponential dichotomy. 

We remark that this linear vector field is also interesting from the point of 
view that even though its coefficients are time periodic, it has no time dependent 
periodic solutions. 



End of Example 3.6.2 

With the notion of exponential dichotomy in hand we can now define the 
notion of a hyperbolic trajectory of a time dependent vector field. 

Definition 3.6.2 (Hyperbolic Trajectory) Let 7 (f) denote a trajectory 
of the vector field 



x = f{x,t), X G [/ C M”, t G K, (3.6.16) 

where U C K" is an open set, f is C’’, r > 1, in x, and continuous in t. 
Then 7(f) is said to be a hyperbolic trajectory if the associated linearized 
system 

C = Dxfh(t),t)^, 
has an exponential dichotomy. 



(3.6.17) 




56 



3. Invariant Manifolds: Linear and Nonlinear Systems 



We give a geometrical explanation of the notion of exponential di- 
chotomy. For this, the picture is more easily understood in the extended 
phase space: 

£: = {(x,t) € K” X Mix € [/}, (3.6.18) 

i.e., we append the dependent variable t to the phase space. We consider the 
velocity field defined on extended phase space by appending the (trivial) 
evolution of t to (3.6.16) as follows 

i = 

i=l, xGC/cM", teM, (3.6.19) 

and the hyperbolic trajectory in the extended phase space S is denoted by 

r(t) = ( 7 (t),t). (3.6.20) 

We define a time slice of the extended phase space £ as follows: 

T.r = {{x,t)e£\t = T}. (3.6.21) 

Then in the extended phase space the hyperbolic trajectory T(t) intersects 
Yir in the unique point 7 (t). 

In Definition 3.6.1 suppose the projection operator P has rank k. Then 

(3.6.3) implies that on the phase slice S,-, there is a fc dimensional sub- 
space of M”, corresponding to trajectories of the linearized equa- 

tions (3.6.17) that decay to zero at an exponential rate as t — >■ oo. Similarly, 

(3.6.4) implies that on the phase slice S,-, there is an n — k dimensional sub- 
space of M”, corresponding to trajectories of the linearized equations 

(3.6.17) that decay to zero at an exponential rate as t ^ —oo. Moreover, 
the angle between E‘^{t) and E'^{t) is bounded away from zero for all r. 
We illustrate this geometrically in Fig. 3.6.1. 

Next we examine how this situation carries over for the nonlinear equa- 
tions. 



3.6b Stable and Unstable Manifolds of 
Hyperbolic Trajectories 

We can now state the result that says that tangent to these linearized 
eigenspaces we have invariant manifolds for the full nonlinear vector field. 
Let Dp{r) £ Sr denote the ball of radius p centered at 7 (t). Then 

Akp(r(t)) = UreR(Ep(T),r) , 

is a tubular neighborhood of T(t) in £. We have the following theorem. 

Theorem 3.6.3 (Local Stable and Unstable Manifolds) For the set- 
up and hypotheses described above, there exists k 1 dimensional C’’ 
manifold Wf^jVft)) C £, and an n — k \ dimensional manifold 
W!fjT{f)) C £ and po sufficiently small such that for p € (0,po) 
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FIGURE 3.6.1. Geometry of the stable and unstable subspaces of the linearized 
system associated with the hyperbolic trajectory, both in the extended phase 
space and on a time slice, n = 2, k — 1. (The arrows are meant to indicate that 
the structure extends for all time.) 

1. Wig^{T{t)), the local stable manifold ofT{t) , is invariant under the 
forward time evolution generated by (3.6.19), W)fJ(r{t)), the local 
unstable manifold ofTft) , is invariant under the backward time evo- 
lution generated by (3.6.19). 

2. W((^(T{t)) and WK^(T{t)) intersect alongVff), and the angle between 
the manifolds is bounded away from zero uniformly for all t G K. 

3. Every trajectory on W((^(T(t)) can be continued to the boundary of 
Afp(T(t)) in backward time, and every trajectory on WJfJJdft)) can 
be continued to the boundary ofAfp(T(t)) in forward time. 

4 . Trajectories starting on at time t = t approach r(f) at 

an exponential rate e~^ (‘“'^) as t ^ 00 and trajectories starting 
on W)fjT{f)) at time t = t approach T{t) at an exponential rate 
g-A \t-T\ Pig f ^ Jqp gQfng constant A' > 0. 

5. Any trajectory in Np (r(f)) not on either ^Zoc(^(^)) 

will leave Afp (r(t)) in both forward and backward time. 



Proof: In some sense this theorem has been known for some time, although 
the autonomous version is much more widely known. The theorem can be 
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obtained from simple modifications of results found in Coddington and 
Levinson [1955] and Hale [1980]. The theorem in this form can be found 
in the Ph. D. thesis of Kaper [1992], see also Yi [1993a, b]. A discrete time 
version can be found in Katok and Hasselblatt [1995]. For a different ap- 
proach see Irwin [1973] and de Blasi and Schinas [1973]. □ 

The global stable and unstable manifolds, IT®(r(t)) and LF“(r(t)), are 
obtained in the usual way by evolving trajectories on and 

backward and forward in time, respectively. We illustrate this 
situation geometrically in Fig. 3.6.2. 



t 




t 



FIGURE 3.6.2. Geometry of the stable and unstable manifolds of the nonlinear 
system associated with the hyperbolic trajectory, both in the extended phase 
space and on a time slice. (The arrows are meant to indicate that the structure 
extends for all time. 



Persistence of Hyperbolic Trajectories, and their Stable and 
Unstable Manifolds, Under Perturbation 

We now want to consider what happens to this structure under perturba- 
tion. Consider the vector field 

X = f(x,t; e), 

i=l, xGUcW^, teK, (3.6.22) 

where e £ C (with denoting the ball of radius cq centered at 
0), and 



f{x,t; 0) = f{x,t). 
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We assume that (3.6.22) depends upon the vector of parameters e in a C’’ 
manner, with all of the partial derivatives of f{x, t, e), with respect to e, up 
to order r, being uniformly continuous and uniformly bounded in iC x K, 
where K is an arbitrary, compact subset of . 

Theorem 3.6.4 (Persistence Under Perturbation) Suppose that at 
e = 0 Theorem 3.6.3 holds for (3.6.22). Then there exists eg sufficiently 
small such that for all e G (3.6.22) possesses a hyperbolic trajectory 
rg(t), having a k+1 dimensional stable manifold, IU'*(rg(t)) and an n—k+1 
dimensional unstable manifold, IU“(rg(t)). These manifolds have the same 
properties as described in Theorem 3.6.3. Moreover, they depend on e in a 
C’’ manner. 

Proof: See Kaper [1992] or Yi [1993a, bj. □ 

An obvious question is how does one locate hyperbolic trajectories in 
vector fields having a general time dependence? Theorem 3.6.4 provides us 
with an easy way to do this if our problem is in the form of a time dependent 
perturbation of an autonomous vector field, i.e., consider a vector field of 
the form 



i = f{x) + eg{x,t] e), 

(=1, xGC/cK", teK. (3.6.23) 

Suppose that for e = 0 the vector field x = v(x) has a hyperbolic fixed point. 
Then, by Theorem 3.6.4, for e small this hyperbolic fixed point becomes 
a (generally) time varying hyperbolic trajectory with stable and unstable 
manifolds. 



3.7 Invariant Manifolds in a Broader Context 

In this chapter the invariant manifolds of interest have been the stable, 
unstable, and center manifolds of fixed points. Invariant manifold theory is 
a much broader and deeper subject and in this section we wish to describe 
some of the issues associated with invariant manifolds in a broader context. 

Existence of Invariant Manifolds 

Invariant manifold theory begins by assuming that some “basic” invari- 
ant manifold exists-the theory is then developed from this point onwards 
by building upon this basic invariant manifold. In particular, one is inter- 
ested in the construction of stable, unstable, and center manifolds asso- 
ciated with these basic invariant manifolds. The types of basic invariant 
manifolds typically considered are: 



1. Equilibrium points. 
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2. Periodic orbits, 

3. Quasiperiodic or almost periodic orbits (invariant tori). 

These manifolds all share an important property. Namely, they all admit a 
global coordinate description. In this case the dynamical system is typically 
subjected to a “preparatory” coordinate transformation that serves to lo- 
calize the dynamical system about the invariant manifold. This amounts to 
deriving a normal form in the neighborhood of an invariant manifold and 
it greatly facilitates the various estimates that are required in the analy- 
sis. Sacker [1969], Fenichel [1971], and Hirsch, Pugh, and Shub [1977] were 
among the first to consider general invariant manifolds that are not de- 
scribed as graphs and required an atlas of local coordinate charts for their 
description. This is an important generalization since in recent years in- 
variant manifolds that cannot be expressed globally as a graph have arisen 
in applications, see Wiggins [1990], Hoveijn [1992], and Haller and Wiggins 
[1996] for applications where invariant spheres arise. 

The Persistence and Differentiability of Invariant Manifolds 
Under Perturbation 

The question of whether or not an invariant manifold persists under per- 
turbation and, if so, if it maintains, loses, or gains differentiabilty is also 
important. In considering these issues it is important to characterize the 
stability of the unperturbed invariant manifold. This is where the notion 
of normal hyperholicity arises. Roughly speaking, a manifold is normally 
hyperbolic if, under the dynamics linearized about the invariant manifold, 
the growth rate of vectors transverse to the manifold dominates the growth 
rate of vectors tangent to the manifold. For equilibrium points, these growth 
rates can be characterized in terms of eigenvalues associated with the lin- 
earization at the equilibria that are not on the imaginary axis, for periodic 
orbits these growth rates can be characterized in terms of the Floquet mul- 
tipliers associated with the linearization about the periodic orbit that are 
not on the unit circle, for invariant tori or more general invariant manifolds 
these growth rates can be characterized in terms of exponential dichotomies 
(see Coppel [1978] or Sacker and Sell [1974]) or by the notion of generalized 
Lyapunov type numbers (Fenichel [1971]), which is the approach that we 
will take in this book. A question of obvious importance for applications 
is how does one compute whether or not an invariant manifold is normally 
hyperbolic? The answer is not satisfactory. For equilibria, the problem in- 
volves finding the eigenvalues of a square matrix-an algebraic problem. For 
invariant manifolds on which the dynamics is nontrivial the issues are more 
complicated, and they are dealt with in this book. However, one important 
class of dynamical systems which may have nontrivial invariant manifolds 
on which the dynamics is also nontrivial are integrable Hamiltonian sys- 
tems, see Wiggins [1988] for examples. Finally, we want to alert the reader 
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to an important characteristic of normal hyperbolicity that is of some im- 
portance for understanding the scope of possible applications. Namely, it is 
insensitive to the form of the dynamics on the invariant manifold, provided 
the dynamics transverse to the invariant manifold is dominant in the sense 
of normal hyperbolicity. Heuristically, one could think of the dynamics on 
the invariant manifold as being “slow” as compared to the “fast” dynamics 
off the invariant manifold. Hence, the dynamics on the invariant manifold 
could even be chaotic. 

Characterizing growth rates in the fashion described above requires 
knowledge of the linearized dynamics near orbits on the invariant man- 
ifold as t ^ -koo or t — >■ — oo. Hence, if the invariant manifold has a 
boundary (which an equilibrium point, periodic orbit, or invariant torus 
does not have), then one must understand the nature of the dynamics at 
the boundary. Notions such as overflowing invariance or inflowing invari- 
ance are developed to handle this. Invariant manifolds with boundary arise 
very often in applications, see Wiggins [1988]. 

Behavior Near an Invariant Manifold Stable, Unstable, and 
Center Manifolds 

A “stable manifold theorem” asserts that the set of points that approach 
an invariant manifold at an exponential rate as t ^ -koo is an invariant 
manifold in its own right. The exponential rate of approach is inherited from 
the linearized dynamics as the stable manifold is constructed as a graph 
over the linearized stable subspace or subbundle. An “unstable manifold 
theorem” asserts similar behavior in the limit as t ^ —oo. Obviously, one 
may have problems with both of these concepts if the invariant manifold 
has a boundary. 

The notion of a center manifold is more subtle. For equilibrium points and 
periodic orbits a center manifold is an invariant manifold that is tangent 
to the linearized subspace corresponding to eigenvalues on the imaginary 
axis, and floquet multipliers on the unit circle, respectively. In contrast to 
the situation with stable and unstable manifolds, the asymptotic behavior 
of orbits in the nonlinear center manifold may be very different than the 
asymptotic behavior of orbits in the linearized center subspaces, under the 
linearized dynamics. 

Questions related to persistence and differentiability of stable, unstable, 
and center manifolds also arise. 

More Refined Behavior Near Invariant Manifolds Foliations of 
Stable, Unstable, and Center Manifolds 

One may be interested in which orbits in the stable manifold approach 
the invariant manifold at a specified rate. Under certain conditions these 
orbits may lie on submanifolds of the stable manifold which are not in- 
variant, but make up an invariant family of submanifolds that foliate the 
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stable manifold. A similar situation may hold for the unstable manifold. 
Moreover, this foliation has the property that points in a fiber of the fo- 
liation asymptotically approach the trajectory in the invariant manifold 
that passes through the point of intersection of the fiber with the invariant 
manifold (the hasepoint of the fiber). This is a generalization of the notion 
of asymptotic phase, that is familiar from studies of stability of periodic or- 
bits, to arbitrary invariant manifolds. In recent years these foliations have 
seen many uses in applications. Fenichel [1979] has used them in his de- 
velopment of geometric singular perturbation theory. Kovacic and Wiggins 
[1992], Haller and Wiggins [1993], [1995], and McLaughlin et al. [1996] have 
used them in the development of new global perturbation methods. Finally, 
the recent monograph of Kirchgraber and Palmer [1990] proves a number 
of foliation results and shows how these can be used as coordinates in which 
the dynamical system becomes linear. 

Invariant Manifolds for Stochastic Dynamical Systems 

Invariant manifold theory for stochastic, or random dynamical systems 
is currently being developed. See Arnold [1998], Boxler [1989], [1991] and 
Schmalfuss [1998]. 



3.8 Exercises 

1. Prove that the boundary, closure, interior, and complement of an invariant set are 
invariant. 

2. Prove that the closure and interior of a positively invariant set are positively invariant. 

3. Prove that the complement of a positively invariant set is negatively invariant. 

4. Prove that the union and intersection of a family of invariant (resp. positively invariant) 
sets are invariant (resp. positively invariant). 

5. Let X 1 -^ /( 31 ), X G be a diffeomorphism and suppose fc = 0 is a fixed point, i.e., 
f (0) — 0. We define the stable set oi x — 0 as: 

S = {y G K”” I f^{y) 0, as n —¥■ oo| . 

Is tS a manifold? (Be careful here, see McGehee [1973].) 

6. Consider the linear vector field x = Ax, A a constant matrix. 

(a) Verify that x{t) — c^^xq is a trajectory of this vector field passing through the 
point xq at t — 0. 

(b) Prove uniqueness of solutions, i.e., verify that this is the only solution passing 
through Xq at t — 0. 

(c) Describe the dependence of trajectories on initial conditions and time (e.g., 
continuous, C^, analytic, etc.). 

(d) Suppose A depends on parameters., i.e., A — A{fi). Describe the dependence of 
the solutions on parameters in terms of the parameter dependence of A (e.g., 
continuous, C^, analytic, etc.). 

(e) Show that x{t + r) = xq is also a solution, for any r G K. Does this 

violate uniqueness of solutions? 
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(f) Prove that two different trajectories cannot intersect. 

7. Prove that for any fixed f, the map 

: R” -> R", 

X 1-^ 4>t{x) = e^*x^ 

defines a diffeomorphism of into 

8. Prove that (pt{^) = satisfies the following properties: 

(a) 4>o{x) — X, 

(b) (p_t o <pt{x) ^ X, 

(c) (f)t o (f)s(x) — (f)t + s(x), 

for all t G M., X G A one-parameter family of diffeomorphisms satisfying these three 
properties is said to be a flow, and we say that the solutions of the vector field x — Ax 
define a flow on the phase space. 

9. Prove that 

V {A, Xa,'y) == Ker {A — Xcx,-^ , a — s,u,c, 'y — i, k, m, 

is an invariant (under A) subspace of Prove that it is also invariant under . 

10. Prove that Ph is a real vector space. 

11. Prove that E£ is a complex subspace of 

12. Prove that (-Ec)® — E, 

13. Suppose E C R”" is a subspace with basis 

{ei, . . . , Cfc} = e, 

and 

A : E ^ E, 

is a real linear map. 

(a) Prove that e is also a basis for Ec. 

(b) Prove that the matrix representation of A with respect to e is the same as the 
matrix representation of Ac with respect to e. 

14. Let be a real vector space and A a linear map of E into E. Show that 

(a) (Ker)j. = Ker (Ac), 

(b) (Im A)c = Im (Ac). 

(c) (A“^)^ = (Ac)“^, if A is invertible. 

In this exercise Ker denotes the kernel and Im denotes the image, or range. 

15. Suppose A : E E is a. real linear map. Show that A and Ac have the same 

characteristic polynomial. 

16. Prove that 

(A, ) 

is an invariant (under A) subspace of R”^. Prove that it is also invariant under e'^^ . 
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17. Prove that , and E^ , defined in (3.1.6), (3.1.7), and (3.1.8), respectively, are 

each invariant under . 

(a) Prove that trajectories with initial conditions in E^ approach zero at an expo- 
nential rate as t ^ oo. 

(b) Prove that trajectories with initial conditions in E^ approach zero at an expo- 
nential rate as t — oo. 

18. What can one say about trajectories with initial conditions in E‘^ under the following 
conditions on the eigenvalues: 

(a) there are no repeated eigenvalues, 

(b) there are no zero eigenvalues, 

(c) there are no zero eigenvalues, and no repeated eigenvalues. 

19. Suppose that E'^ — 0. Prove that the only trajectories starting in a neighborhood of 
the origin that remain in that neighborhood for all time are those that start on E^ . 

20. The Variation of Constants Formula. Consider the following inhomogeneous linear 
ordinary differential equation: 

X = Ax g{t), X G K"’, 



where A is an n X n-matrix of real numbers. Verify that the solution of this equation 
passing through xq at f = 0 is given by 

x{t) — e'^^ f e~'^^ g{s)ds c'^^xq. 

Jo 



Prove that this solution is unique. What conditions on g{t) are required for this unique 
solution to exist? 

21. Consider the block diagonal form of the linear vector field given in (3.2.5). Relate this 
block diagonal form of the vector field to the decomposition of the phase space as 

22. Consider the following linear vector fields on R^. 



f \ ^ f ^ A<0 

^ \X2 J \0 fl J \X2 J ’ /J->0' 

> \X2 J J J ’ A«<0- 

'i f \ — f ^ —uj\/xi\ X < 0 

^ \X2 J ~ yco X J yX2 J ' UJ > 0 

-> (2)- (s »<»■ 

•> (2)-(S J)(2)> »>»■ 

0 (2)-(S S)C0 



i. For each vector field compute all trajectories and illustrate them graphi- 
cally on the phase plane. Describe the stable, unstable, and center mani- 
folds of the origin. 

ii. For vector field a), discuss the cases |A| < /^, |A| = /x, and |A| > /x. What are 
the qualitative and quantitative differences in the dynamics for these three 
cases? Can the unstable manifold of the origin be considered an attracting 
set and/or an attractor and do the relative magnitudes of the eigenvalues 
affect these conclusions? 
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iii. For vector field b), discuss the cases X <. fi, X — fi, X > fi. What are 
the qualitative and quantitative differences in the dynamics for these three 
cases? Describe all zero- and one-dimensional invariant manifolds for this 
vector field. Describe the nature of the trajectories at the origin. In par- 
ticular, which trajectories are tangent to either the xi or X 2 axis? 

iv. In vector field c), describe how the trajectories depend on the relative 
magnitudes of A and co. What happens when A = 0? When u; = 0? 

V. Describe the effect of linear perturbations on each of the vector fields, 
vi. Describe the effect near the origin of nonlinear perturbations on each of 
the vector fields. Can you say anything about the effects of nonlinear per- 
turbations on the dynamics outside of a neighborhood of the origin? 

We remark that vi) is a difficult problem for the nonhyperbolic fixed points. We will 
study this situation in great detail when we develop center manifold theory and bifur- 
cation theory. 

23. Give a characterization of the stable, unstable, and center subspaces for linear maps in 
terms of generalized eigenspaces along the same lines as we did for linear vector fields 
according to the formulae (3.1.6), (3.1.7), and (3.1.8). 

24. For the following linear vector fields find the general solution, and compute the stable, 
unstable, and center subspaces and plot them in the phase space. 



a) 

b) 

c) 

d) 

e) 

f) 

g) 

b) 



f xi 
V X2 



10 1 
0 0-2 
0 1 0 

0 0 15 

1 0 -17 

0 1 7 



0 0 1 
0 12 
0 3 2 



3 0 0\ f XI 

0 2 -5 X2 

0 1 - 2 / 

1 -3 3\ / xi 

3-5 3 \ X2 

6—6 4/ \X3 

-3 1 -l\ / XI 
-7 5-1 I X2 

-6 6 -2 J \ a :3 

10 0\ / XI 

12 0 ] \ X2 

1 0 -1 / V2^3 



25. Consider the following linear maps on ] 



b) 

c) 

d) 



/ xi 

V X2 

f Xi_ 

V X2 



X 0 

0 /J. 

A 0 

0 /j. 



1 0 
0 A 



A —uj \ f x\ 
UJ X ) \X2 



f XX 
V X2 



|A| < 1 

ImI > 1 ■ 

|A| < 1 

ImI < 1 ■ 

a; > 0. 



|A| < 1. 
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i. For each map compute all the orbits and illustrate them graphically on 
the phase plane. Describe the stable, unstable, and center manifolds of the 
origin. 

ii. For map a), discuss the cases A, /^ > 0; A = 0, /x > 0; A, p. < 0; and A < 0, 
/X > 0. What are the qualitative differences in the dynamics for these four 
cases? Discuss how the orbits depend on the relative magnitudes of the 
eigenvalues. Discuss the attracting nature of the unstable manifold of the 
origin and its dependence on the relative magnitudes of the eigenvalues. 

iii. For map b), discuss the cases A, ^ > 0; A = 0, /x > 0; A, /x < 0; and A < 0, 
/X > 0. What are the qualitative differences in the dynamics for these four 
cases? Describe all zero- and one-dimensional invariant manifolds for this 
map. Do all orbits lie on invariant manifolds? 

iv. For map c), consider the cases A^ -|- < 1, A^ -|- uP" > 1, and A -|- feu = 

for OL rational and a irrational. Describe the qualitative differences in the 
dynamics for these four cases. 

V. Describe the effect of linear perturbations on each of the maps. 

vi. Describe the effect near the origin of nonlinear perturbations on each of the 
maps. Can you say anything about the effects of nonlinear perturbations 
on the dynamics outside of a neighborhood of the origin? 

We remark that vi) is very difficult for nonhyperbolic fixed points (more so than the 
analogous case for vector fields in the previous exercise) and will be treated in great 
detail when we develop center manifold theory and bifurcation theory. 

26. Consider the following vector fields. 



a) 


X = 


y^ 


(x. v'] F R^. 






y = 


— 5y — /xa:, 








b) 


X — 

y = 


y^ 

— 6y — fix — 




{x, y) G 




c) 


X — 


y^ 


3 


(x, y) G 






y = 


— 6y — fix — 


X , 




d) 


X — 

y ^ 


—5x -fiy-\- 
fix - 6y + ^ 


xy, 


-v^), 


1 G R^. 


e) 


X — 


— X x^, 


{x, 


,y) G rL 






y = 


x + y, 






f) 


r — 
0 - 


r(l — r^), 
cos 40, 


(r 


, 0) G R’b X 




g) 


r — 
0 = 


r{6 4- fJ^P — 




(r, 0) G M’*' 


X sL 


h) 


0 = 
i) — 


V, 

— sin 0 — 6v 


+ 


{0,v) G 


X R. 


i) 


0^ . 


- ^1, 


IV 


{01 ■, 02) ^ 


X s' 




02 - 


- C02 + 01 ^ n 






j) 




- 02 — sin 0\ , 

- —02-, 




(01,02) G s" X 


sL 



k) (^1,^2) G S" X SL 

U2 — ^2 5 

Describe the nature of the stable and unstable manifolds of the fixed points by draw- 
ing phase portraits. Can you determine anything about the global behavior of the 
manifolds? 

In a), b), c), d), g), and h) consider the cases <5 < 0, <5 = 0, <5 > 0, /x < 0, /x = 0, and 
/X > 0. In i) and k) consider cui > 0 and cu 2 > 0. 
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FIGURE 3.8.1. 



27. Consider the following maps. 



a) 

b) 



c) 

d) 



e) 



f) 



h) 



X X, 

y ^ X -\-y, 


(x, y) e R^. 


X x^ , 

y ^ X -\-y, 


{x,y) G R^ 


^ Oi, 


G S" X SF 


9i 1 -^ sin 9i , 
9<2 ^ 9\, 


(9i,e^) G S" X sF 


(^) 


1/2 (a:, y) G R^. 


a; M. 

V >->■ (a:y)^''^ 


{x,y) G R^. 


X \-¥ yi — 5y ■ 
y X, 


(a:.y)GRU 


9 9 V, /'o \ ^ 

V Sv — fj,cos{9 + v), ^ 



Describe the nature of the stable and unstable manifolds of the fixed points by drawing 
phase portraits. Can you determine any global behavior? Describe the nature of the 
In g) and h) consider the cases (5 < 0, (5 = 0, 5 > 0, /j- < 0, /j- = 0, and ^ > 0. 



28. Consider the vector field 



X — X, 

V = -V, (x, v) G R^. 



(see Fig. 3.8.1). 

The origin is a hyperbolic fixed point with stable and unstable manifolds given by 
VK'’(0, 0) = |(a:, y) I a; = o}, VK“(0,0) = { (a;, y) | y = o} . 

Let 

Us — y) I |y — yol < o < x < e, for some e > o|, 

Uu — |(3;5y)||a:— a;o| 0<y<e, for some e > 0 j- ; 
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Show that you can find smaller closed sets Us C Us, Uu C Uu, such that Us maps 
onto Uu under the time T flow map (T must be chosen carefully; it depends on the 
size of the Us, Uu) cmd such that horizontal and vertical boundaries of Ug correspond 
to horizontal and vertical boundaries of Uu- How would this problem be formulated 
and solved for maps? 

(Note: This seemingly silly exercise is important for the understanding of chaotic in- 
variant sets. We will use it later when we study the orbit structure near homoclinic 
orbits.) 

29. Consider the vector field 

I 2 

X — —X y , 

y — —2x'^ (x, y) G 

(a) Show that y — x"^ is an invariant manifold. 

(b) Show that there is a trajectory connecting the equilibrium points (0, 0) and 

(HI)- 

(c) Is y — x^ the center manifold of the origin? 

30. Consider the vector field 

X — —X, 

y — 2y — 5x^ , {x, y) G M.^. 

(a) Show that y — x^ is an invariant manifold. 

(b) Determine the global stable and unstable manifolds of the origin. 

31. Consider a map 

X ^ f{x, y), 

y^g{x,y), (x,y) xR^. 

Suppose the graph oi y = h{x) is an invariant manifold for this map. Derive the “tan- 
gency condition” for discrete time systems that is analogous to (3.2.7) for continuous 
time systems. How differentiable must the map be for the condition to hold? 

32. Consider the map 

, 2 
X y ^ 

y i—y 2y — 2x^ -1- x'^y’^ , (x, y) G 

(a) Is the map a diffeomorphism? 

(b) Show that y — x"^ is an invariant manifold. 

(c) Show that (0,0) and (1, 1) are fixed points. 

(d) Does there exist heteroclinic orbits between these fixed points? 

33. Consider the graph transform proof of the existence of the local unstable manifold of 
the origin for (3.5.1). 

(a) Show that Ss is a complete metric space. 

(b) The graph of h{x), |a:| < e, was shown to be locally invariant for a fixed f > 0. 
Show that it is locally invariant for t' ^ t. 

(c) Show that trajectories starting on the graph of h{x), |ai| < e decay to the origin 
at an exponential rate as f ^ — oo. 

(d) Show that the global unstable manifold defined in (3.5.13) is invariant and one 
dimensional. 

(e) Show that h{x)., |a:| < e is . 
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34. Consider the Liapunov-Perron proof of the existence of the local unstable manifold of 
the origin for (3.5.1). 

(a) Show that S§ is a complete metric space. 

(b) Show that if h{xo) satisfies (3.5.20), then the graph of h{x) is invariant. 

(c) Show that trajectories starting on the graph of h{x), |rr| < e decay to the origin 
at an exponential rate as t ^ — oo. 

(d) Show that unstable manifold found by this method is . 

35. Show that the conditions (3.6.3) and (3.6.4) in the definition of exponential dichotomy 
can be written in the equivalent form 

II X (t)Px II < K'^ exp ( — Ai(t — r)) || X {r)Px || , t ^ 

II X{t) (id — P) a: II < K '2 exp (A 2 (t — r)) || X(r) (id — P) a: ||, t < r, 

\\ X{t)PX~^{t) \\ < K'^, forallt, 



for constants K' 2 , K'^ > 0 and any x. 

Show that the stable subspace E^{t) is given by X{r)Px for all x at time r, and 
that the first of these conditions describes its evolution for t ~> t. Similarly, show 
that the second condition implies that there is a time-dependent, subspace (P^(r)) of 
solutions that decay to zero at an exponential rate as t ^ —00 and that it is given 
by X(r)(id — P)x for all x at time r, and the second condition describes its evolution 
for t <. T. Show that the third condition implies that the angle between these two 
subspaces stays bounded away from zero for all time. 

36. In Example 3.6.2 compute and plot the time dependent stable and unstable subspaces 
in the extended phase space. Do they have any periodicity (in time) properties? 

37. Consider the following linear, time dependent vector field: 

X — —X t, 

y = y-x, (£C,y)eR^. (3.8.1) 

(a) Show that the general solution through an arbitrary point (£Co,yo) at f = 0 is 
given by 

x(t) = t — 1 + e“* (xo + 1) , 

y{t) = t + e* (i/o + 2) + i (e“* - (xq + 1) . 

(b) Show that the trajectory {x{t),y{t)) — (t — l,t) has an exponential dichotomy. 

(c) Compute and plot the stable and unstable manifolds of the trajectory 
(x{t),y{t)) — [t — l,t) in the extended phase space. 

38. Consider the following vector field: 

X — X, 

y = -y + x'^ Qd(t) + a(t)^ , 



where a{t) is an arbitrary time dependent function. 

(a) Show that the origin is a hyperbolic trajectory. 

(b) Argue that the graph oi y — is the global unstable manifold of the origin. 

What requirements must be made on the function a{t) in order that these conclusions 
are true? 
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39. Consider the following modification of (3.5.1): 



X — a(t)x^ 
V = 



(3.8.2) 



where 

a{t), b(t) > A > 0,Vi, 



where A is a constant. 

(a) Is the origin a trajectory of (3.8.2)? 

(b) Consider (3.8.2) linearized about the origin. Discuss the stable and unstable 
subspaces associated with the origin and the nature of trajectories in these sub- 
spaces. 

(c) Use the Liapunov-Perron method to show that the origin has a (time dependent) 
unstable manifold. 

(d) Use the graph transform method to show that the origin has a time dependent 
unstable manifold. 
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Periodic Orbits 



In this chapter we will describe a very important type of orbit for vector 
fields and maps: periodic orbits. In a certain sense, periodic orbits are 
the only types of orbits that we can ever hope to understand completely 
throughout their evolution from the distant past (i.e. as t — >■ — oo) to the 
distant future (i.e., as t ^ oo) since the entire course of their evolution is 
determined by knowledge over a finite time interval, i.e., the period^. For 
this reason it is very tempting, and often quite advantageous, to try to 
understand the key features of a dynamical system in terms of the periodic 
orbits. We will discuss this a bit more at the end of this chapter. But now 
we begin with a definition. 

We consider vector fields 

x = f{x,t), cc G K", (4.0.1) 

and maps 

X i-G g{x), X G K". (4.0.2) 

Definition 4.0.1 (Periodic Orbits) (Vector Fields^ A solution of 
(f.0.1) through the point xq is said to be periodic of period T if there 
exists T > 0 such that x(f,to) = x(f + T,xo) for all t G K. (WIaps^ The 
orbit of xo G K" is said to be periodic of period k > 0 if g'^{xo) = xq . 

We remark that if a solution of (4.0.1) is periodic of period T then 
evidently it is periodic of period nT for any integer n > 1. However, by the 
period of an orbit we mean the smallest possible T > 0 such that Definition 
4.0.1 holds. A similar statement holds for periodic orbits of maps. 



^This is not a completely accurate statement. One could make the same ar- 
gument for fixed points, and orbits homoclinic or heteroclinic to fixed points and 
periodic orbits. However, one could argue that a fixed point is a particularly 
simple type of periodic orbit. 
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4.1 Nonexistence of Periodic Orbits for 

Two-Dimensional, Autonomous Vector Fields 

Now we will learn a useful and easily applicable trick for establishing the 
nonexistence of periodic solutions of autonomous vector fields on the plane. 
We will denote these vector fields by 

i = f{x,y), 

y = g{x,y), (x,y) (4.1.1) 

where / and g are at least C^. 

Theorem 4.1.1 (Bendixson’s criterion) If on a simply connected re- 
gion £) C (i.e., D has no holes in it) the expression gy + not 

identically zero and does not change sign, then (4.1.1) has no closed orbits 
lying entirely in D. 

Proof: This is a simple result of Green’s theorem on the plane; see Abraham, 
Marsden, and Ratiu [1988]. Using (4.1.1) and applying the chain rule we 
find that on any closed orbit T we have 

J f dy - g dx = 0. (4.1.2) 

By Green’s theorem this implies 

where S is the interior bounded by F. But if fy + 7^ 0 and doesn’t 

change sign, then this obviously can’t be true. Therefore, there must be no 
closed orbits in D. □ 

A generalization of Bendixson’s criterion due to Dulac is the following. 

Theorem 4.1.2 Let B{x,y) be on a simply connected region D C 
F 4" identically zero and does not change sign in D, then 

(4.1.1) has no closed orbits lying entirely in D. 

Proof: The proof is very similar to the previous theorem so we omit it and 
leave it as an exercise. □ 

Example 4.1.1 (Application to the Unforced Duffing Oscillator). 

Consider the vector held 

x = V = f{x,y), 

y = x-x^-5y = g{x, y), <5 > 0. (4.1.4) 
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An easy calculation shows that 

^ ^ = -ri 

dx dy 

Thus, for (5 > 0, (4.1.4) has no closed orbits. We will answer the question of what 
happens when 5 = 0 in Chapter 5. 



End of Example 4.1.1 

The next example shows how Theorem 4.1.1 allows us to restrict regions 
in the plane where closed orbits might exist. 

Example 4.1.2. Consider the following modification of the unforced Duffing 
oscillator 

x = y = f{x,y), 

y = X — — 6y + x^y = g{x,y), 5 > 0. (4.1.5) 

This equation has three fixed points at (x,y) = (0,0), (±1,0) with the eigenval- 
ues, Ai, 2 , of the associated linearization about each fixed point given by 




FIGURE 4.1.1. The regions defined by a; = (the figure is drawn for S > 1). 



(0,0)^Ai,2 = ^±i\/52±4, (4.1.6) 

(1, 0) ^ Ai,2 = ± IVi-S + 1)2 - 8, (4.1.7) 

(-1, 0) ^ Ai,2 = ± IVi-S + 1)2 - 8. (4.1.8) 



Thus, (0, 0) is a saddle, and (±1, 0) are sinks for 5 > 1 and sources for 0 < <5 < 1. 
A simple calculation gives 



dx dy 



—S + x^ . 



(4.1.9) 
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Thus, (4.1.9) vanishes on the lines x = ±'/5. These two lines divide the plane 
into three disjoint regions which we label (from left to right) R\, R 2 , and as 
shown in Figure 4.1.1. 

Now from Theorem 4.1.1, we can immediately conclude that (4.1.5) can have 
no closed orbits lying entirely in either region R\, R 2 , or R 3 . However, we cannot 
rule out the existence of closed orbits which overlap these regions as shown in 
Figure 4.1.2. When we discuss index theory in Chapter 6 we will see how to 
reduce the number of possibilities even further. We finally remark that it is not 
a coincidence that the lines x = ±'/5 fall on the fixed points (±1,0) when the 
real parts of the eigenvalues of these fixed points vanish. We will learn what is 
going on in this case when we study the Poincare- Andronov-Hopf bifurcation. 



End of Example 4.1.2 
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FIGURE 4.1.2. Some possibilities for the existence of close orbits in (4.1.4). a-c 
apply to the case for 5 > 1, d-f to 0 < <5 < 1. 



4.2 Further Remarks on Periodic Orbits 

Below we describe a number of situations where periodic orbits are impor- 
tant. 

Closing Lemmas: A “closing lemma” is a theorem that describes how 
“close” an orbit is to a periodic orbit. Hamiltonian systems and 
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chaotic systems tend to possess periodic orbits in abundance, and 
it is often possible to characterize “most” orbits in terms of periodic 
orbits. ^ One of the first closing lemmas is due to Pugh [1967]. Robin- 
son [1978] provides an excellent introduction to the closing lemma. 
Recent references that describe the state of the art, as well as contain 
references to the previous work are Mane [1982], Hayashi [1997], and 
Aranson et al. [1997]. However, a word of caution is in order, Herman 
[1991] has constructed examples of Hamiltonian systems that con- 
tain no periodic orbits. Moreover, this situation is structurally stable 
in the sense that small perturbations of Herman’s examples also do 
not contain periodic orbits. This should be an indication of just how 
subtle these questions may be. 

Dynamical Averaging: It is often more meaningful to characterize sys- 
tems possessing complex dynamics through certain quantities involv- 
ing asymptotic time averages of trajectories. Examples of such quan- 
tities are power spectra, generalized dimensions, Liapunov exponents, 
and Kolmogorov entropy. Cvitanovic [1995] shows how under certain 
conditions such quantities can be calculated in terms of averages of 
periodic orbits. 

Characterization of Chaos: Periodic orbits play a key role in under- 
standing the structure of chaotic attractors. See Lai et al. [1997], 
Zoldi and Greenside [1998], and the references in these papers. 

Semiclassical Mechanics: Periodic orbits play an important role as a 
bridge between classical and quantum mechanics. Unstable periodic 
orbits in classical systems may give rise to an extra, and unexpected, 
eigenstate density in their vicinity, which is referred to as a “scar” 
in the corresponding quantum mechanical system. See Kaplan and 
Heller [1999], and references therein. 

Mathematical Biology: Periodic orbits are playing an increasingly im- 
portant role in the understanding of certain biological phenomena. 
See, e.g.. So. et al. [1998], Lesher, et al. [1999], and the references in 
these papers. 

Time Series Analysis: Periodic orbits play a key role in distinguishing 
the difference between complicated time series generated by deter- 
ministic chaos, versus those generated by random noise. See Carroll 
[1999], and references therein. 



^Proving closing lemmas for specific classes of dynamical systems requires 
precise hypotheses and rather technical arguments. The quotation marks around 
the words in this paragraph indicate that making these words mathematically 
precise is very important. 
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4.3 Exercises 

1. Show that the following vector field on the cylinder 

i) = — r?, 

e = 1, (v,e) G X 

has a periodic orbit. Explain why Bendixson’s criterion does not hold. 

2. Construct a vector field in with negative divergence that possesses a periodic orbit. 
Construct a vector field in with negative divergence that contains a continuous 
family of periodic orbits. 

3. Prove Theorem 4.1.2. 

4. The definition of periodic orbit for vector fields given in Definition 4.0.1 was given 
for the nonautonomous vector field (4.0.1). Suppose (4.0.1) is not periodic in time. 
That is, there is no constant T > 0 such that f{x, t) — f{x, t + T) for all t, x in the 
domain of definition. We refer to such nonautonomous vector fields as aperiodically 
time- dependent. Can aperiodically time dependent vector fields have periodic orbits? 
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Vector Fields Possessing an 
Integral 



For a general vector field 

X = f{x), X € M", 

a scalar valued function I{x) is said to be an integral (sometimes the term 
first integral is used) if it is constant on trajectories, i.e., 

i{x) = VI{x) ■ X = VI{x) ■ f{x) = 0, 

where “ • ” denotes the usual Euclidean inner product. From this relation 
we see that the level sets of I{x) (which are generally (n— 1 (-dimensional) 
are invariant sets. For two-dimensional vector fields the level sets actually 
give the trajectories of the system. We examine this case in more detail. 



5.1 Vector Fields on Two-Manifolds Having an 
Integral 

In applications, three types of two-dimensional phase spaces occur fre- 
quently; they are (1) the plane, x (2) the cylinder, x 

and (3) the two-torus, x . The vector field can be written as 

X = f{x,y), . . 

y = g{x,y), 

where / and g are C’’ (r > 1), and as {x,y) G x for a vector field 
on the plane, as (x, y) G x for a vector field on the cylinder, and as 
(x, y) £ X for a vector field on the torus, where denotes the circle 
(which is sometimes referred to as a 1-torus, T^). We now want to give 
some examples of how these different phase spaces arise and at the same 
time go into more detail concering the idea of an integrable vector field. We 
begin with the unforced Duffing oscillator. 

Example 5.1.1 (The Unforced Duffing Oscillator). We have been slowly dis- 
covering the global structure of the phase space of the unforced Duffing oscillator 
given by 



X — X + Sx + = 0, 



(5.1.2) 
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or, written as a system, 

5>0. (5.1.3) 

Thus far we know the local structure near the three fixed points {x, y) = (0, 0), 
(±1, 0) and that for 5 > 0 there are no closed orbits. The next step is to under- 
stand the geometry of the global orbit structure. In general, this is a formidable 
task. However, for the special parameter value 5 = 0, we can understand com- 
pletely the global geometry, which, we will see, provides a framework for under- 
standing the global geometry for 5 ^ Q. 




FIGURE 5.1.1. Graph of V{x). 

The reason we can do this is that, for <5 = 0, the unforced, undamped Duff- 
ing oscillator has a first integral or a function of the dependent variables whose 
level curves give the orbits. Alternately, in more physical terms, the unforced, 
undamped Duffing oscillator is a conservative system having an energy function 
which is constant on orbits. This can be seen as follows — take the unforced, un- 
damped Duffing oscillator, multiply it by x, and integrate as below. 

XX — XX + xx^ = 0 



or 



hence, 



or 



d 

dt 




= 0 ; 





= h — constant 




(5.1.4) 



(5.1.5) 



This is a first integral for the unforced, undamped Duffing oscillator or, if 
you think of y^/2 as the kinetic energy (mass has been scaled to be 1) and 
— a;^/2-|-x^/4 = V{x) as potential energy, h can be thought of as the total energy 
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V(x) 




FIGURE 5.1.2. 

of the system. Therefore, the level curves of this function give the global structure 
of the phase space. 



End of Example 5.1.1 

In general, for one-degree-of- freedom problems (i.e., vector fields on a 
two-dimensional phase space) that have a first integral that can be viewed 
as the sum of a kinetic and potential energy, there is an easy, graphical 
method for drawing the phase space. 

We will illustrate the method for the unforced, undamped Duffing oscillator. 
As a preliminary step, we point out the shape of the graph of V (x) in Figure 
5.1.1. 

Now suppose that the first integral is given by 

h= — + V{x)-, 



then 

y = ±V2sJh-V{x). (5.1.6) 

Our goal is to draw the level sets of h. Imagine sitting at the point 
(0,0), with h fixed. Now move toward the right (i.e., let x increase). A 
glance at the graph of V (x) shows that V begins to decrease. Then, since 
y = +V^ \/h — V{x) (we take the -I- sign for the moment) and h is fixed, 
y must increase until the minimum of the potential is reached, and then 
it decreases until the boundary of the potential is reached (why can’t you 




80 



5. Vector Fields Possessing an Integral 




FIGURE 5.1.3. 
y 




FIGURE 5.1.4. 

go farther?) (see Figure 5.1.2). Now y = + or — y/2 — U(x); hence the 

entire orbit through (0,0) for fixed h is as in Figure 5.1.3. 

(Note: why are the arrows drawn in their particular directions in Figure 
5.1.3?) By symmetry, there is another homoclinic orbit to the left as in 
Figure 5.1.4, and if you repeat this procedure for different points you can 
draw the entire phase plane as shown in Figure 5.1.5. 

The homoclinic orbit is sometimes called a separatrix because it is the 
boundary between two distinctly different types of motions. We will study 
homoclinic orbits in some detail later on. 



y 




FIGURE 5.1.5. Orbits of the unforced, undamped Duffing oscillator. 
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FIGURE 5.1.6. Fixed points of the pendulum. 

Denoting the first integral of the unforced, undamped Duffing oscillator 
by h was meant to be suggestive. The unforced, undamped Duffing oscilla- 
tor is actually a Hamiltonian System, i.e., there exists a function h = h{x, y) 
such that the vector field is given by 




FIGURE 5.1.7. a) Orbits of the pendulum on with <j> = ±7t identified, b) 
Orbits of the pendulum on the cyliner. 
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K ( 5 - 1 - 7 ) 



(we will study these in more detail later). Note that all the solutions 
lie on level curves of h which are topologically the same as (or T^). 
This Hamiltonian system is an integrable Hamiltonian system and it has 
a characteristic of all n-degree-of-freedom integrable Hamiltonian systems 
in that its bounded motions lie on n-dimensional tori or homoclinic and 
heteroclinic orbits (see Arnold [1978] or Abraham and Marsden [1978]). 
(Note that all one-degree-of-freedom Hamiltonian systems are integrable.) 
More information on Hamiltonian vector fields can be found in Chapters 
13 and 14. 



Example 5.1.2 (The Pendulum). The equation of motion of a simple pendulum 
(again, all physical constants are scaled out) is given by 

(() + sin</) = 0 (5.1.8) 



or, written as a system, 

= e xR\ (5.1.9) 

■i) = — sin (j>, 

This equation has fixed points at (0,0), (±7t,0), and simple calculations 
show that (0, 0) is a center (i.e., the eigenvalues are purely imaginary) and (Ttt, 0) 
are saddles, but since the phase space is the cylinder and not the plane, (±7t, 0) 
are really the same point (see Figure 5.1.6). (Think of the pendulum as a physical 
object and you will see that this is obvious.) 

Now, just as in Example 5.1.1, the pendulum is a Hamiltonian system with a 
first integral given by 

/i = -^ — coa4>- (5.1.10) 

Again, as in Example 5.1.1, this fact allows the global phase portrait for the 
pendulum to be drawn, as shown in Figure 5.1.7a. Alternatively, by gluing the 
two lines (j> = irr together, we obtain the orbits on the cylinder as shown in 
Figure 5.1.7b. 



End of Example 5.1.2 



5.2 Two Degree-of-Freedom Hamiltonian Systems 
and Geometry 

We now give an example of a two degree- of- freedom Hamiltonian system 
that very concretely illustrates a number of more advanced concepts that 
we will discuss later on. 
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Consider two linearly coupled Harmonic oscillators that we write as a 
system as follows 



dH 

xi = yi = 

oyi 

yi = - c(xi - X2) 

dH 

X2 = y2= 

OV 2 

y2 = -UJ^X2 - c(x2 - Xi) 



dH 
dxi ’ 



dH 
dx2 ’ 



where the Hamiltonian, H is given by, 



H{xi,yi,X2,y2) 



2 2 2 2 
m ^ ^1 , ^ 

2 2 2 



2 2 
u X 2 






This is a linear system which can be transformed into real Jordan canonical 
form as follows 



Ui=Vi = 



dH 
dvi ’ 



hi = — = — 



U2 = V2 = 



dH 
dv2 ’ 



V2 = — H2M2 = — 



dH 
dui ’ 



dH 
du2 ’ 



where 

rii = w, ^2 = \/ up- + 2c, 



and 






flfui 

2 



2 



(Note that using the same symbol “H” for the Hamiltonian in two different 
sets of coordinates is not particularly good, and can lead to confusion.) Now 
one clearly sees that the level sets of the Hamiltonian are three-spheres, 
S^. Thus the four dimensional phase space is foliated by a one-parameter 
family of invariant three-spheres. Next we will study the dynamics on these 
three-spheres. 



5.2a Dynamics on the Energy Surface 

First we transform to polar coordinates 
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I i • ■ / 

Y + J—9^cos0i = \/2IiVtiCOS0i, 



\j 2Ii^i COS 0i , 

21 , 



Vi = \ %-i,cos0i - ij2I,Q.i0, sin 0, = sin 6»i, 



Vli 



= 1,2. 



Combining these relations with the equations of motion in the ui — f i — 
U 2 — V 2 coordinates gives 

= 0 

0i = fii, i = 1, 2. 

Full Equations of Motion 

The full equations of motion are given as 

01 = fli = If, 



A = 0 = - 



dH 
d9i ’ 



02 = ^2 



dH 
dh ’ 



with Hamiltonian 



A = 0 = - 



dH 
d02 ’ 



H{Il, I 2 ) — Il^l + l2^2- 



Equations of Motion Restricted to the Energy Surface 

Since trajectories are restricted to lie in the three dimensional energy sur- 
face the dynamics is really three dimensional. Now we show how this can 
be realized in the equations of motion. 

The equation for the energy surface is 

H{h,l 2 ) = + 12^2 = h = constant, 



which we can easily rearrange and explicitly exhibit I 2 as a function of Ii 
and h 



I 9 — 



h — /ifli 

n*? 



= l2{h,h). 



Hence, if we know Ii and h, we know l 2 - Therefore, on the energy surface, 
the equations reduce to 



01 — 111 , 
02 = H2, 

/i = 0. 



(5.2.1) 
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From these equations we see that the energy surface is foliated by a one- 
parameter family of invariant two-tori. 



5.2b Dynamics on an Individual Torus 

Fixing h and Ii chooses an individual two-torus. The trajectories on this 
torus are given by 

Oi{i) = riit 

= ^2t + H20- 

The nature of the trajectories depend on the number If this is a ratio- 
nal number then all trajectories are periodic. If it is irrational, then any 
trajectory densely covers the torus. These statements are proved in Section 
10.4a. 

For a more detailed study of the geometry of two coupled linear oscilla- 
tors, which makes connections with many deeper topics such as the Hopf 
fibration of the three-sphere and knots, see the excellent paper of Meyer 
[1990]. 

5.3 Exercises 



1 . Consider the following vector fields. 



a) X + fix = 0, ic G 

b) X + fix + — 0, X ^ M.^, 

c) X + fix + — 0, X ^ M.^, 



d) 



X = —fiy + xy, 
y = fix + ^(x'^ - y'^ 



{x, y) e R^. 



i) Write a), b) and c) as systems. 

ii) Find and determine the nature of the stability of the fixed points, 

iii) Find the first integrals and draw all phase curves for fi <. 0 , fi — 0 , and 
fi > 0. 



2 . Euler’s equations of motion for a free rigid body are 



rrii — liiOi, 


i — \ 


I2 - I3 




mi = 


m2m3 






1 

CO 




m 2 — — — — 


mim3 


^1^3 




h - I2 




m 3 = ^ ^ 


mim2 


^2 





, 3 , Ii > I2 > I3 



(mi , m2, m3) G 



a) Find and determine the nature of the stability of the fixed points. 

b) Show that the functions 



H{mi,m2,m3) — - 



A I2 I3 



L(mi , m2 , m3) — m^ + m.2 + m^ 



are constants on orbits. 
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c) For fixed L, draw all phase curves. 

3. At the beginning of this chapter we defined the notion of an integral for autonomous 
vector fields. An integral is a (scalar valued) function that is constant on trajectories, 
or more geometrically, it is a function having the property that the vector field is 
tangent to its level sets. Generalize these ideas for the case of nonautonomous vector 
fields. 

4. Consider the a map: 

X I— g{x) , X G , 



or 

Xn + l = g{Xn). 

Define the notion of an integral for maps. 
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Before we describe some of the uses of index theory, we will give a heuristic 
description of the idea. 

Suppose we have a vector field defined in some simply connected region, 
7?,,of the plane (this is a two-dimensional method only). Let L be any closed 
loop in TZ which contains no fixed points of the vector field. You can imagine 
at each point, p, on the loop L that there is an arrow representing the value 
of the vector field at p (see Figure 6.0.1). 




FIGURE 6.0.1. Vector field on the closed curve F. 

Now as you move around F in the counter-clockwise sense (call this the 
positive direction), the vectors on F rotate, and when you get back to the 
point at which you started, they will have rotated through an angle 27rfc, 
where k is some integer. This integer, fc, is called the index o/F. 

The index of a closed curve containing no fixed points can be calculated 
by integrating the change in the angle of the vectors at each point on F 
around F (this angle is measured with respect to some chosen coordinate 
system) . For a vector field defined on some simply connected region, TZ, of 
the plane given by 



i = f{x,y), 

y = 9{x,y), {x,y) gTZ 



( 6 . 0 . 1 ) 
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the index of F, k, is found by computing 

-u 

-hi 

This integral has several properties, one of the most important being 
that it retains the same value if F is smoothly deformed, as long as it is not 
deformed through some fixed point of the vector field. The index of a fixed 
point is defined to be the index of a closed curve which contains only this 
one fixed point, and where no fixed points are on the closed curve. From 
the definition of the index given above (if not by just drawing pictures), 
one can prove the following theorems. 

Theorem 6.0.1 i) The index of a sink, a source, or a center is +1. 

ii) The index of a hyperbolic saddle point is —1. 

iii) The index of a periodic orbit is +1. 

iv) The index of a closed curve not containing any fixed points is 0. 

v) The index of a closed curve is equal to the sum of the indices of the 
fixed points within it. 

An immediate corollary of this is the following. 

Corollary 6.0.2 Inside any periodic orbit 7 there must be at least one 
fixed point. If there is only one, then it must be a sink, source, or center. 
If all the fixed points within 7 are hyperbolic, then there must be an odd 
number, 2n + 1, of which n are saddles and n + 1 either sinks or sources. 

For more information on index theory, see Andronov et al. [1971]. 

A homoclinic orbit should not should not be treated as a periodic orbit 
for the purpose of the application of index theory. In fact, there are 
examples of vector fields on the two-sphere containing homoclinic orbits 
which in turn do not surronnd an equilibrium point. In particular, in 
Guillemin and Pollack [1974] there is a phase portrait of a vector held 
on the two sphere containing one saddle- type equlibrium point with all 
orbits homoclinic to this saddle point. 

Example 4.1.2 Revisited Using the above results, the reader should be 
able to verify that the phase portraits shown in Figures 5.1.2b and 5.1.2e 
cannot occur. This example shows how Bendixson and Dulac’s criteria used 
with index theory can go a long way toward describing the global struc- 
ture of phase portraits on the plane. We remark that a higher dimensional 
generalization of index theory is degree theory. For an introduction to the 
use of degree theory in dynamical systems and bifurcation theory we refer 
the reader to Chow and Hale [1982] or Smoller [1983]. 
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Exercises 



There are six phase portraits of vector fields on the plane shown in Figure 6.1.1. 
Using various phase plane techniques, determine which phase portraits are correct and 
which are incorrect. Modify the incorrect phase portraits to make them correct, not 
by deleting any orbits shown but by changing the stability types of existing orbits or 
adding new orbits. 




FIGURE 6.1.1. 
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Some General Properties of 
Vector Fields: Existence, 
Uniqueness, Differentiability, 
and Flows 



In this section we want to give some of the basic theorems describing general 
properties of solutions of vector fields. Since it is just as easy to treat the 
nonautonomous case we will do so. 

Consider the vector field 



where f{x,t) is C’’, r > 1, on some open set U C M" x 



(7.0.1) 



7.1 Existence, Uniqueness, Differentiability with 
Respect to Initial Conditions 

Theorem 7.1.1 Let (xopo) G U. Then there exists a solution of (7.0.1) 
through the point xq att = to, denoted x{t, to, xo) with x(to, to, xo) = xo, for 
\t — to\ sufficiently small. This solution is unique in the sense that any other 
solution of (7.0.1) through xo at t = to must be the same as x(t,to,xo) on 
their common interval of existence. Moreover, x{t,to,xo) is a C’’ function 
of t, to, and xo. 

Proof: See Arnold [1973], Hirsch and Smale [1974], or Hale [1980]. □ 



We remark that it is possible to weaken the assumptions on f{x, t) and 
still obtain existence and uniqueness. We refer the reader to Hale [1980] for 
a discussion. 

Theorem 7.1.1 only guarantees existence and uniqueness for sufficiently 
small time intervals. The following result allows us to uniquely extend the 
time interval of existence. 
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7.2 Continuation of Solutions 

Let C CU C M" X be a compact set containing (xo,to)- 

Theorem 7.2.1 The solution x{t,to,xo) can he uniquely extended back- 
ward and forward in t up to the boundary of C. 

Proof: See Hale [1980]. □ 

Theorem 7.2.1 tells us how solutions fail to exist; namely, they “blow 
up.” Consider the following example. 

Example 7.2.1. Consider the equation 

X = , a; € (7.2.1) 

The solution of (7.2.1) through xo at t = 0 is given by 

x(t,0,xo)^ (7.2.2) 

It should be clear that (7.2.2) does not exist for all time, since it becomes infinite 
at t = 1/xo- This example also shows that the time interval of existence may 
depend on xq. 



End of Example 7.2.1 

In practice we often encounter vector fields depending on parameters, 
and it is often necessary to differentiate the solutions with respect to the 
parameters. The following result covers this situation. 



7.3 Differentiability with Respect to Parameters 

Consider the vector field 

x = f{x,t;p.), (7.3.1) 

where f{x, t; p) is C” (r > 1) on some open set U C K" x x 

Theorem 7.3.1 For (to,xo, p) € U the solution x{t,tQ,xo,p) is a func- 
tion oft, to, xq, and p. 

Proof: See Arnold [1973] or Hale [1980]. □ 

At this stage we would like to point out some special properties of C”, 
r > 1, autonomous vector fields which will prove useful. 
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7.4 Autonomous Vector Fields 

Consider the vector field 



i = f{x), xeM", (7.4.1) 

where f{x) is C’’, r > 1, on some open set U G K". For simplicity, let us 
suppose that the solutions exist for all time (we leave it as an exercise to 
make the necessary modifications when solutions exist only on finite time 
intervals). The following three results are very useful in applications. 

Proposition 7.4.1 If x{t) is a solution of (7. 4-1), then so is x(t + r) for 
any r G K. 

Proof: By definition 

Hence, we have 

dx{t + t) dx{t) 

dt t=to 
or 



= /A(t + d)L=t„- (7.4.3) 

t = to 

Since (7.4.3) is true for any to G K, the result follows. □ 



dx{t + t) 
dt 



= f{x{to + t)) = f{x{t + r)) 1^^ 



t=to 



Note that Proposition 7.4.1 does not hold for nonautonomous vector 
fields. Consider the following example. 

Example 7.4.1. Consider the nonautonomous vector field 



X = e^, X G 


(7.4.4) 


The solution of (7.4.4) is given by 




x{t) = e*, 


(7.4.5) 


and it should be clear that 

x{t + t) = 

is not a solution of (7.4.4) for r yf 0. 


(7.4.6) 




End of Example 7.4.1 



The following proposition lies at the heart of the Poincare-Bendixson 
theorem. 
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Proposition 7.4.2 For any xq G K" there exists only one solution of 
(7.4-1) passing through this point. 

Proof: We will show that if this proposition weren’t true, then uniqueness 
of solutions would be violated. 

Let Xi{t), X2{t) be solutions of (7.4.1) satisfying 



Xi{ti) = Xo, 

X2{t2) = Xo- 



By Proposition 7.4.1 



X2{t) =X2{t- {ti - t2)) 

is also a solution of (7.4.1) satisfying 



^2(^1) = 2:0- 



Hence, by Theorem 7.1.1, xi{t) and X 2 {t) must be identical. □ 



Since for autonomous vector fields time-translated solutions remain so- 
lutions (i.e.. Proposition 7.4.1 holds), it suffices to choose a fixed initial 
time, say to = 0, which is understood and therefore often omitted from the 
notation (as we do now). 

Proposition 7.4.3 (Properties of a Flow) 

i) x{t, Xo) is C’’. 

ii) x(0,a:o) = xq. 

iii) x(t -I- s, a;o) = x(t, a;(s, Xq)) . 

Proof: i) follows from Theorem 7.1.1, ii) is by definition, and iii) follows 
from Proposition 7.4.2; namely, x{t,Xo) = x{t + s,Xo) and x(t,x{s,Xo)) 
are both solutions of (7.4.1) satisfying the same initial conditions at t = 0. 
Hence, by uniqueness, they must coincide. □ 

Proposition 7.4.3 shows that the solutions of (7.4.1) form a one-parameter 
family of C’’, r > 1, diffeomorphisms of the phase space (invertibility comes 
from iii)). This is referred to as a phase flow or just a flow. A common 
notation for flows is (f)(t,x) or 4>t{x). 

Let us comment a bit more on this notation 4>t{x). The part of Theorem 
7.1.1 dealing with differentiability of solutions with respect to xo (regarding 
t and to as fixed) allows us to think differently about the solutions of 
ordinary differential equations. More precisely, in the solution xftflojXo), 
we can think of t and to as fixed and then study how the map x{t,to,xo) 
moves sets of points around in phase space. This is the global, geometrical 
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view of the study of dynamical systems. For a set [/ C M", we would denote 
its image under this map by x{t, to, U). Since points in phase space are also 
labeled by the letter x, it is often less confusing to change the notation for 
the solutions, which is why we use the symbol (j). This point of view will 
become more apparent when we study the construction of Poincare maps. 

Finally, let us note that in the study of ordinary differential equations one 
might believe the problem to be finished when the “solution” x(t,to,xo) is 
found. The rest of the book will show that this is not the case, but, on the 
contrary, that this is when the story begins to get really interesting. 



7.5 Nonautonomous Vector Fields 



It should be clear that Propositions 7.4.1, 7.4.2, and 7.4.3 do not hold 
for nonautonomous vector fields. However, we can always make a nonau- 
tonomous vector field autonomous by redefining time as a new dependent 
variable. This is done as follows. 

By writing (7.0.1) as 

dt I ^ ^ 

and using the chain rule, we can introduce a new independent variable s 
so that (7.5.1) becomes 



dx , „ . , 

(7.5.2) 

as 

If we define y = (x,t) and g{y) = (/(x,t), 1), we see that (7.5.2) becomes 
y' = g{y), y G M” x (7.5.3) 



Of course, knowledge of the solutions of (7.5.3) implies knowledge of the 
solutions of (7.0.1) and vice versa. For example, if x{t) is a solution of 
(7.0.1) passing through xq t = to, i.e., x{to) = xo, then y(s) = (x(s + to), 
t(s) = s -I- to) is a solution of (7.5.3) passing through yo = (x(to),to) at 
s = 0. 

Every vector field can thus be viewed as an autonomous vector field. 
This apparently trivial trick is a great conceptual aid in the construction 
of Poincare maps for time-periodic and quasiperiodic vector fields, as we 
shall see in Chapter 10. Notice, however, that in redefining time as a de- 
pendent variable, it may then be introduced in various situations requiring 
specification of initial positions (i.e., specifying xo); in particular, the reader 
should reexamine the definition of stability given in Chapter 1. There are 
alternative, and more useful, views of nonautonomous vector fields that 
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we briefly describe since they are becoming more and more prominent in 
current developments in dynamical systems theory. 

Let x{t,to,xo), denote a solution of (7.0.1) with x{to,to,xo) = xq. Then 
the analog of property Hi) of Proposition 7.4.3 is the following: 



x{t 2 ,to,xo) = Vxo G R", and all to < ti < t 2 , 

(7.5.4) 

which is called the cocycle property. The notion of a cocycle is central 
to current developments in ergodic theory (Katok and Hasselblatt [1995]), 
random dynamical systems (Arnold [1998]), and nonautonomous dynamical 
systems (Kloeden and Schmalfuss [1997]). We will describe the cocycle 
formalism in more generality, but we first describe the skew-product flow 
approach to nonautonomous vector fields. 

7.5a The Skew-Product Flow Approach 

The skew-product flow approach is a way of dealing with nonautonomous 
vector fields so as to retain the flow properties as described in Proposition 
7.4.1. This approach has been pioneered by Sell [1967a, b], [1971]. 

The crucial observation leading to the development of the approach is the 
following. Suppose x{t) is a solution of i = f{x,t)- Then Xr{t) = x{t + r) 
is a solution of Xr{t) = fr{xr{t), t) = f{x{t + T),t + T) (compare this with 
the proof of Proposition 7.4.1). 

We first define the space of nonautonomous vector fields whose time 
translates remain within the space, i.e., 

T = {Space of functions / : K" x M — >■ R" such that 
/r(-,-) = /(•.• + 't) G Tier allr G R.} 

We then define the group of shift operators on T\ 

9r ■■ T 

f^6rf = fr, VtGR, (7.5.5) 



and we define the product space: 



X = W xT. 



A slight variation in notation is now useful. Let x(t,xo,f) denote a solu- 
tion of i = f{x,t) with x(0,xo,/) = Xq. Finally, we define the family of 
mappings: 



Tt : A ^ A, 

(Xoj) {x{t,Xo,f),etf) . 



(7.5.6) 
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'I't is a one-parameter family of mappings of X into X, or flow This 
follows easily from the fact that the identity (i.e., the analog of (7.5.4)): 

x{t + s, a:o, /) = x{t, x{s, xq, f),0sf), (7.5.7) 

is the same as 

'ft+s{xoJ) = ^to^>,{xoJ). (7.5.8) 

It is easy to see that the role of the shift operators 9r is to advance the 
explicit time argument in the vector field. The reader should imagine this 
in the context of attempting to prove Proposition 7.4.1 for nonautonomous 
vector fields. 

The notion of a skew-product flow derived from nonautonomous vector 
fields can be made general, and is referred to as a skew-product dynami- 
cal system (or just skew dynamical system). This is done as follows. Let 
M denote an n-dimensional manifold and S a metric space (or, possibly, 
something more general) . Consider a map of the form: 

f : M X S ^ M X S, 

(m,s) 1 -^ {(p{m,s),a{s)) . (7.5.9) 

Further, we will assume that for each fixed s, the map (j){-, s) : M — >■ M is 
a C"’ diffeomorphism. Then the system defined by (7.5.9) is referred to as 
a skew dynamical system over the base a : S ^ S. 

Skew dynamical systems play a central role in dynamical systems in their 
own right. The following topics and references are representative of these 
areas and applications. 

General Theory of Skew-Product Dynamical Systems: Sacker and 
Sell [1977], Sacker [1976], Shen and Yi [1998] develop many basic 
properties of skew-product dynamical systems. 

Invariant Manifold Theorems: Chow and Yi [1994], Yi [1993a, b] prove 
invariant manifold theorems for nonautonomous vector fields by using 
the skew-product dynamical systems approach. 

Global Stability: Meyer and Zhang [1996] develop many of the basic con- 
cepts of skew-product dynamical systems (e.g., stable and unstable 
manifolds of hyperbolic trajectories, hyperbolic sets, shadowing, basic 
sets, etc. ) for studying global dynamics in much the same manner as 
described in, e.g. Shub [1987], for standard differential dynamical dy- 
namical systems, i.e., dynamics defined by iterated diffeomorphisms 
or flows on manifolds. Johnson and Kloeden [2001] discuss attractors 
for skew-product dynamical systems. 



^It is important (but not for our descriptive purposes here) to have a topology 
on X such that the map (t,xo,f) x{t,xo,f) is continuous, see Sell [1967a, b] 
for details. 
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7.5b The Cocycle Approach 

The cocycle property can be expressed in a general formalism. Let P denote 
a parameter space. In different applications P could be a compact metric 
space, a function space, or a probability space. Let 0 = {6*t 1 1 C K} denote 
a one-parameter family of mappings of P into itself, i.e.. 



P i-A 9tp, 



with 

9t o Os = Ot+s, Vt, s G K, 

and 

6»o = id. 

Then we have the following definition. 

Definition 7.5.1 (Cocycle on M") A family of mappings 
4>t,p ■ K” R”, t G R, p G P, 

is called a cocycle on R” with respect to a group 0 of mappings on P if: 
P 4^0. p — idf 

4^t+s.p — f^tPsp ^ 4^s,p) 



for all t, s G R, p G P. 

The reader should be able to verify that (7.5.7) is an example of a cocycle. 
We show that the cocycle relation for nonautonomous vector fields given 
in (7.5.4) can be interpreted in terms of this general formalism. This can 
be seen as follows. Let P = R, Otto = to +t and to < to + s < to + s + t. 
Then we have: 



4^t+s.p — ^(^oT'^TfToj ^o) — ^0 ; ^0 ) ) — 4^t,dsp^4^s,p- 



7.5c Dynamics Generated by a Bi-Infinite 
Sequence of Maps 

There is yet another way to study the dynamics of nonautonomous vector 
fields. The dynamical systems point of view has taught us that it is often 
fruitful to study the trajectories of (7.0.1) by passing to an n-dimensional 
discrete time system, or map. We will explore this in some detail when we 
consider Poincare maps in Chapter 10. In this setting the dynamical aspects 
of geometrical structures in phase space often appear more simple than in 
the continuous time setting. One reason for this is that the trajectories 
of time-dependent velocity fields may intersect themselves. The resulting 
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trajectory can appear very complicated, but when “sampled” at discrete 
time its discrete time analog may appear more simple, and any underlying 
geometrical structure may be more apparent. 

There are a variety of ways of constructing a discrete time map from the 
trajectories of (7.0.1). However, the overriding issue is that the dynamics 
of the resulting discrete time system should somehow correlate with the 
dynamics of the continuous time system. The most straightforward way in 
which this connection can be made is if the trajectory of the velocity field 
interpolates the discrete set of points corresponding to the trajectory of the 
discrete time system. 

With this in mind, we define the following n-dimensional map from tra- 
jectories of (7.0.1) as follows: 

fn{xo) =x{to + nT, to + {n- 1)T, xq) , (7.5.10) 

where T > 0 is some fixed time increment and n € 2. We want to describe 
the evolution of Xq in the vector field (7.0.1) in terms of the map (7.5.10). 
Unfortunately, this cannot be done in the case when the velocity field has 
an arbitrary time-dependence since generally 

fj(xo) 7^ fk{xo) j yf k. 

Instead, we must use the bi-infinite sequence of maps 

{/«(•)}) nGZ, 

since the orbit of a point Xq in the vector field (7.0.1), i.e., 

{x G K" I X = x(t, to, xo), t G K} , 
interpolates the following bi-infinite sequence of points. 

{. . . , /_„ o /_„+! o • • • o /_i(xo), . . . , /-i(xo), Xo, /i(xo), 

■■ - ,fn° fn-l° ■■ ■ ° fl{xo),---} ■ 

We remark that in the usual situation studied in dynamical systems the- 
ory, if the velocity field is time periodic, with period T, then fj{-) = 
fk{'), Vj, fc G 2. Some basic results for dynamics generated by sequences 
of maps (such as a stable and unstable manifold theorem) can be found 
in Katok and Hasselblatt [1995]; see also de Blasi and Schinas [1973] and 
Irwin [1973]. 

For the most part in this book we will be considering autonomous vec- 
tor fields or maps constructed from nonautonomous vector fields (more 
specifically, maps constructed from time-periodic and quasiperiodic vector 
fields). Consequently, henceforth we will state definitions in the context of 
autonomous vector fields and maps. 
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7.6 Liouville’s Theorem 

Now we prove several results due to Liouville which describe the time evo- 
lution of volumes under a flow. 

Consider a general autonomous vector field 

X = f{x), X G M”, 

and suppose that it generates a flow Let Dq denote a domain in M” 
and let Dt = (j)t{Do) denote the evolution of Dq under the flow. Let V{t) 
denote the volume of Df. Then we have the following lemma. 

Lemma 7.6.1 



dt 



V • fdx, 



t=o 



I Do 



where V • / = -I- • • • -I- denotes the divergence of a vector field. 

Proof: From the definition of the Jacobian of a transformation we have 



V{t) = ( 

J Do 



det^dx. 

OX 



(7.6.1) 



By Taylor expanding the flow in t\ 

(f>t{x) =x + f{x)t + Oft^), 

it follows that 

where “id” denotes the n x n identity matrix. Using this relation, and a 
standard result from linear algebra on the expansion of the determinant 
(see, e.g., Arnold [1973], Sec. 16.3), we obtain 



det 






= 1 -I- tr 



dl 

dx 



t + 0{f). 



Substituting this expression into (7.6.1) gives 

V{t) = V{0)+ [ tv ■ fdx + Oit"^), 
Jdo 



from which the lemma follows immediately. 



□ 
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Now there is nothing distinguished about the point t = 0. In particular, 
Lemma 7.6.1 can be written in the form 



dt 



t = to 




V • fdy, 



for an arbitrary to 0- This can be seen as follows. Let 



(7.6.2) 



y = </>io(a;), 



for some arbirary to ^ 0. Then 




d(t)t{y) 

dy 



dy. 



Now 

My) = y + f{y)t + o{M, 



and 



dMy) 

dy 



■ , df 

= id+ -^t 
dy 






Using these three formulas we can repeat exactly the same steps as in the 
proof of Lemma 7.6.1 to conclude that (7.6.2) holds. 

We can use these results to derive equations for the time evolution of 
volumes which can be explicitly solved in certain cases. 

Suppose that the divergence is everywhere constant, i.e.. 



V • / = c = constant. 

Using (7.6.2), since to is arbitrary the evolution equation for the volume is 
given by 

V = cV, 

which has the obvious solution 



V{t) = e=*U(0). (7.6.3) 

In the case where the vector field is divergence free (i.e., c = 0), we have 
the following result that is typically referred to as Liouville ’s Theorem. 

Theorem 7.6.2 (Liouville) Suppose V • / = 0. Then for any region Dq, 

V{t) = U(0), 

where U(0) is the volume of Do and V{t) is the volume of 4>t (Do). 

Proof: This follows immediately from (7.6.3) by setting c = 0. □ 
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7.6a Volume Preserving Vector Fields and the 
Poincare Recurrence Theorem 

We refer to vector fields which have zero divergence as divergence free or 
volume preserving vector fields. 

Volume preserving dynamical systems, which we think of as flows gener- 
ated by divergence free vector fields or volume preserving maps, may have 
a certain recurrence property that is attributed to Poincare. We now prove 
this Poincare recurrence theorem. 

Theorem 7.6.3 (Poincare Recurrence Theorem) Suppose g : K" — 
K" is a continuous, one-to-one mapping, and suppose that D C K” is a 
compact invariant set, i.e., g{D) = D. Let x he any point in D and let 
U he any neighborhood of x. Then there exists a point x G U such that 
(?”(x) € U for some n > 0. 

Proof: Consider the sets defined by the images of U under iteration by g: 
U,g{U),g\U),...,g^iU),.... 

Since g is volume-preserving, each of these has the same volume. If they 
never intersected, then D would have infinite volume. But D is compact, 
therefore there must exist integers k > 0, I > 0, with k > I, such that 

g\U)ng\U)^ID. 

Therefore 

/■'([/) nc/y^0. 

If we let y = g^~\x), then x G U and g^{x) G U, where n = k — 1. □ 



7.7 Exercises 

1. Consider the stable and unstable manifolds of a hyperbolic fixed point of saddle-type 
of a C’' (r > 1) vector field. 

a) Can the stable (resp., unstable) manifold intersect itself? 

b) Can the stable (resp., unstable) manifold intersect the stable (resp., unstable) 

manifold of another fixed point? 

c) Can the stable manifold intersect the unstable manifold? If so, can the intersection 

consist of a discrete set of points? 

d) Can the stable (resp., unstable) manifold intersect a periodic orbit? 

These questions are independent of the dimension of the vector field (as long as it is 
finite); however, justify each of your answers with a geometrical argument for vector 
fields on IR^. {Hint: the key to this problem is uniqueness of solutions.) 

2. Consider the stable and unstable manifolds of a hyperbolic fixed point of saddle-type 
of a (r > 1) diffeomorphism. 

a) Can the stable (resp., unstable) manifold intersect itself? 
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b) Can the stable (resp., unstable) manifold intersect the stable (resp., unstable) 

manifold of another fixed point? 

c) Can the stable manifold intersect the unstable manifold? If so, can the intersection 

consist of a discrete set of points? 

These questions are independent of the dimension of the diffeomorphism (as long as 
it is finite); however, justify each of your answers with a geometrical argument for 
diffeomorphisms on Are the arguments the same as for vector fields? 

3. Consider the (r > 1) vector field 

X = f{x ) , X G R”” . 

Let </>t(rr) denote the flow generated by this vector field, which we assume exists for 
all t G R, a: G R^. Suppose that the vector field has a hyperbolic fixed point at 
X — X having an s-dimensional stable manifold, VL^(x), and a u-dimensional unstable 
manifold, W^{x) (s + u = n) . The typical way of proving their existence (see, e.g. , Palis 
and deMelo [1982] or Fenichel [1971]) is to prove the existence of the local manifolds 
VPjqc(S) and W^^{x) via a contraction mapping type of argument. Then the global 
manifolds are defined by 

w‘{x) = ij ;,($)), 

t<0 

= U 

t>o 

a) Show that W^{x) and W^{x) defined in this way are invariant for all t G R. 

b) If IFioc(^) ^loc(^) does it follow by this definition that W^{x) and 

W^{x) are C^? 

c) Discuss this definition of the stable and unstable manifolds in the context of how 

one might compute the manifolds numerically. 




FIGURE 7.7.1. 



4. Consider the situation described in the previous exercise in the context of diffeo- 
morphisms. Existence of stable and unstable manifolds of a hyperbolic fixed point is 
proved similarly (i.e., local manifolds are shown to exist via a contraction mapping 
argument), and the global manifolds are defined by 

W‘{x) = U 

n<0 

W'^ix) = U 

n>0 

where g denotes the diffeomorphism and x the hyperbolic fixed point. Answer a), b), 
and c) from the previous exercise in the context of C’’ diffeomorphisms. 

5. Consider a hyperbolic fixed point of a (r > 1) vector field on R^ whose stable and 
unstable manifolds intersect along a homoclinic orbit, as shown in Figure 7.7.1. Show 
that any point on the homoclinic orbit cannot reach the fixed point in finite time. 
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6. Consider a periodic orbit (of either a C’’ (r > 1) vector field or map) that is contained 
in a compact region of phase space. Can the period of the orbit be infinite? 

7. Consider the Lorenz equations: 

i 

y — px — y — xz, ( 7 , /3, p > 0. 
i = —f3z + xy, 

Describe the time evolution of volume elements under the flow generated by this vector 
field. 

8. Does the divergence free property of a vector field imply that the vector field has a 
first integral? 

9. Consider the vector field: 

X — —X + sin t. 

The solution through the point xq at to is given by: 

x{t; to, xq) — — (sin t — cos t) + TaiQ + — cos Iq — — sin to \ 



Construct a skew-product flow. 

10. Consider the vector field: 

X — —X t. 

The solution through the point xq at to is given by: 

x{t-, Xo, to) = t — 1 + (xo — to + . 

Construct a skew-product flow. 
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We now develop a technical apparatus to deal with the notions of “long 
term” and “observable” behavior of orbits of dynamical systems. We will 
be concerned with C” (r > 1) maps and autonomous vector fields on M” 
denoted as follows. 

Vector Field: x = f{x), x G K", (8.0.1) 

Map: X i-G g{x), x G M". (8.0.2) 

The flow generated by (8.0.1) (see Chapter 7) will be denoted as (j){t,x). 



8.1 The Asymptotic Behavior of Trajectories 

As we shall see in Chapter 9, the Poincare-Bendixson theorem characterizes 
the nature of the uj and a limit sets of flows on certain two manifolds. We 
now define uj and a limit sets. 

Definition 8.1.1 {oj and a Limit Points of Trajectories) A point 
xq G K" is called an ui limit point of x € R”, denoted oj{x), if there exists 
a sequence {ti}, U — > oo, such that 

(j){ti,x) — ;> a;o. 

a limit points are defined similarly by taking a sequence {ti}, ti — > — oo. 

Example 8.1.1. Consider a vector field on the plane with a hyperbolic saddle 
point, X, as shown in Figure 8.1.1. Then x is the u limit point of any point on 
the stable manifold and the a limit point of any point on the unstable manifold. 

End of Example 8.1.1 



Example 8.1.2. This example shows why it is necessary to take a subseqnence 
in time, {tf}, and not to simply let t b oo in the definition of the a and u> limit 
point. Consider a vector field on the plane with a globally attracting closed orbit, 
7, as shown in Figure 8.1.2. Then orbits not starting on 7 “wrap onto” 7. 

Now for each point on 7, we can find a subsequence {ti} such that <f>{ti,x), 
a; G approaches that point as i t 00. Therefore, 7 is the uj limit set of x as 
you would expect. However, lim fit, x) 7. 

t — >-00 



End of Example 8.1.2 
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FIGURE 8.1.1. CJ and a limit sets of the hyperbolic fixed point x. 




FIGURE 8.1.2. The point xo £ 7 is the oj limit point of x. 

Definition 8.1.2 (uj and a Limit Sets of a Flow) The set of all io 

limit points of a flow or map is called the uj limit set. The a limit set 
is similarly defined. 

We will need the idea of a and uj limit sets in the context of flows only, so 
we leave it to the reader to modify Definition 8.1.2 for maps as an exercise. 
The following result describes some basic properties of a and w limit sets 
of trajectories. 

Proposition 8.1.3 (Properties of uj Limit Points) Let he a flow 
generated by a vector field and let M. he a positively invariant compact set 
for this flow. Then, for p G A4, we have 

i) uj{p) yf 0; 

ii) ui{p) is closed; 

iii) io{p) is invariant under the flow, i.e., to{p) is a union of orbits; 
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iv) uj{p) is connected. 

Proof: i) Choose a sequence {U}, lim ti = oo, and let {pi = </>(. (p)}. Since 

M is compact, {pi} has a convergent subsequence whose limit belongs to 
co(p). Thus, uj(p) ^ 0. 

ii) It suffices to show that the complement of u{p) is open. Choose q ^ 
Lo{p). Then there must exist some neighborhood of q, U{q), that is disjoint 
from the set of points {(ftip) \ t T } for some T > 0. Hence, q is contained 
in some open set that contains no points in u>{p). Since q is arbitrary, we 
are done. 

iii) Let q € io(p) and q = 4>s{q)- Choose a sequence ti — > oo as z f oo 
with 4>ti{p) — >■ q- Then cj)ti+s{p) = 4>s{4’ti{p)) (cf. the notation for flows 
following Proposition 7.4.3) converges to g as z — >■ oo. Hence, q € oj(p), and 
therefore w(p) is invariant. However, there is a slight hole in this argument 
that needs to be filled; namely, it is not immediately obvious that ifs(-) 
exists for all s. 

We begin by arguing that f>s(q) exists for s G (— 00 , 00 ) when q G u>(p). 
It should be clear that this is true for s G (0, 00 ) since A4 is a positively 
invariant compact set (cf. Theorem 1.1.9). Therefore, it suffices to show 
that this is true for s G (— oo,0]. 

Now q G u>(p), so by definition we can find a sequence {U}, ti — > 00 as 
z t 00 , such that 4>ti{p) — >■ as z — >■ 00 . Let us order the sequence so that 

ti < t 2 < ■ ■ ■ < tn < ■ ■ ■ ■ Next consider 4>s{4>ti{p))- By Proposition 7.4.3 
this is valid for s G [— ti, 0]. Taking the limit as z — >■ 00 and using continuity 
as well as the fact that 0*^ — >■ g as z — >■ 00 , we see that 4>s{q) exists for 
s G (— 00 , 0]. 

iv) The proof is by contradiction. Suppose to{p) is not connected. Then 

we can choose open sets Vi, V 2 such that uj{p) C Pi U V 2 , oj{p) (iVi ^ 
Lo{p) n P 2 0, and Vi (1 V 2 = 0. The orbit of p accumulates on points 
in both Vi and V 2 ; hence, for any given T > 0, there exists t > T such 
that (j)t{p) G M — (Pl U P 2 ) = K. Therefore we can find a sequence {tn}, 
tn — >-oo as zz t 00 , with (ftnip) G Passing to a subsequence, if necessary 
{K is compact), we have 4>t„{p) -G q, q G K . But this implies that q G 
Pi U P 2 . However, our construction indicates that q is also in uj{p). This is 
a contradiction. □ 

One can prove a similar result for a. limit sets provided the hypotheses 
of the proposition are satisfied for the time reversed flow. 

For maps, the notion of a nonwandering point has been more fashionable; 
however, we will explore the relationship between these two concepts in the 
exercises. 

Definition 8.1.4 (Nonwandering Points) A point xq is called nonwan- 
dering if the following holds. 
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Flows: For any neighborhood U of xq and T > 0, there exists some |t| > T 
such that 

4){t,U)f^U 

Maps: For any neighborhood U of xq, there exists some n ^ 0 such that 

g”{U)(lU^ID. 

Note that if the map is noninvertible, then we must take n > 0. 

Fixed points and periodic orbits are nonwandering. 

Definition 8.1.5 (Nonwandering Set) The set of all nonwandering 
points of a map or flow is called the nonwandering set of that particular 
map or flow. 



8.2 Attracting Sets, Attractors, and Basins of 
Attraction 

Definitions 8.1.2 and 8.1.4 do not address the question of stability of those 
asymptotic motions. For this we want to develop the idea of an attractor. 

Definition 8.2.1 (Attracting Set) A closed invariant set A C K" is 
called an attracting set if there is some neighborhood U of A such that: 

flows: Vt > 0, (j>{t, U) C U and Hoo 

maps: Vn > 0, g"‘{U)cU and g'^{U) = A. 

Definition 8.2.2 (Trapping Region) The open set U in Definition 
8.2.1 is often referred to as a trapping region. 

A similar definition can be given for maps. By now the necessary modi- 
fications should be obvious, and we leave the details as an exercise for the 
reader. 

It should be evident to the reader that finding a Liapunov function is 
equivalent to finding a trapping region (cf. Chapter 2) . Also, let us mention 
a technical point; by Theorem 7.1.1 it follows that all solutions starting in 
a trapping region exist for all positive times. This is useful in noncompact 
phase spaces such as for proving existence on semi-infinite time intervals. 

In the continuous time case, one “tests” whether or not a region is a 
candidate to be a trapping region by evaluating the vector field on the 
boundary of the region in question. If, on the boundary of the region, the 
vector field is pointing toward the interior of the region, or is tangent to 
the boundary, then the region in question is a trapping region. However, 
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X 



FIGURE 8.2.1. Basins of attractions of the sinks. 

in order that this test can be carried out, one needs that the boundary of 
the region must be (at least) C^. 

There is another idea related to trapping regions that is becoming more 
commonly used in some areas. This is the notion of an absorbing set, which 
we now define. 

Definition 8.2.3 (Absorbing set) A positive invariant compact subset 
B C M" is called an absorbing set if there exists a bounded subset o/M", 
U , with U D B, and: 

flows: tu > 0 such that 4>{t, U) C B, 'it>tu. 

maps: njj > 0 such that g^{U) C B, \/n > nu- 

If we have an attracting set it is natural to ask which points in phase 
space approach the attracting set asymptotically. 

Definition 8.2.4 (Basin of Attraction) The domain or basin of attrac- 
tion of an attracting set A is given by 

flows: [J(l){t,U), 

t<0 

maps: |J g"(C/), 

n<0 

where U is any open set satisfying Definition 8.2.1. 

We remark that the basin of attraction is independent of the choice of 
the open set U, provided that U satisfies Definition 8.2.1. 

Note that even if g is noninvertible, g~^ still makes sense in a set theoretic 
sense. Namely, g~^{U) is the set of points in K” that maps into U under 
g; g~^, n > 1 is then defined inductively. 
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Example 8.2.1 (Application to the Unforced Duffing Oscillator). 

As we’ve seen, the unforced Duffing oscillator has, for 5 > 0, two attractors 
which are fixed points. The boundaries of the domains of attraction for the two 
attractors are defined by the stable manifold of the saddle of the origin (see Figure 
8.2.1). 



End of Example 8.2.1 

Now we want to motivate the idea of an attractor as opposed to attracting 
set. We do this with the following example taken from Guckenheimer and 
Holmes [1983]. 




FIGURE 8.2.2. Attracting set of Example 1.8.3. 



Example 8.2.2. Consider the planar autonomous vector field 

3 

X = X — X , 

i) = -y, (x,y) e X 

This vector field has a saddle at (0,0) and two sinks at (±1,0). The j/-axis is 
the stable manifold of (0,0). We choose an ellipse, Ai, containing the three fixed 
points as shown in Figure 8.2.2. 

It should be clear that At is a trapping region and that the closed interval 
[— 1, 1] = rit>o attracting set. 



End of Example 8.2.2 

Example 8.2.2 points out what some might regard as a possible deficiency 
in our Definition 8.2.1 of an attracting set. In this example, almost all 
points in the plane will eventually end up near one of the sinks. Hence, 
the attracting set, the interval [—1,1], contains two attractors, the sinks 
(±1,0). Therefore, if we are interested in describing where most points 
in phase space ultimately go, the idea of an attracting set is not quite 
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precise enough. Somehow we want to incorporate into the definition of an 
attracting set the notion that it is not a collection of distinct attractors, 
but rather that all points in the attracting set eventually come arbitrarily 
close to every other point in the attracting set under the evolution of the 
flow or map. We now want to make this mathematically precise. 

Definition 8.2.5 (Topological Transitivity) A closed invariant set A 
is said to be topologically transitive if, for any two open sets U, V C A 

flows: 3 f G K 9 (j){t, U)C\V yf 0, 

maps: 3 n G Z 9 g"(C/) fl yf 0. 

Definition 8.2.6 (Attractor) An attractor is a topologically transitive 
attracting set. 

We remark that the study of attractors and their basin boundaries in dy- 
namical systems is rapidly evolving and, consequently, the theory is incom- 
plete. For more information see Conley [1978], Guckenheimer and Holmes 
[1983], Milnor [1985], and Ruelle [1981]. 

8.3 The LaSalle Invariance Principle 

Here we describe an application of the invariance of uj limit sets of a trajec- 
tory that is very useful for stability issues. It is referred to as the LaSalle 
invariance principle (LaSalle [1968]). 

We first develop the set-up. Let 

X = f{x), X G M", 

denote a C"’, r > 1 vector field. Let Ai C M" be a positively invariant 
compact set under the fiow, generated by this vector field, which is 

the closure of some open set (so that it has nonempty interior) and whose 
boundary is (at least) . Therefore Ad is a trapping region. Let V{x) be 
a Liapunov function on Ai. By this we mean that V < 0 on Ad. Note 
that we are using the term Liapunov function differently than was used 
in Chapter 2. There Liapunov function was a local notion defined in the 
neighborhood of an equilibrium point. Now we will consider a more global 
notion. Consider the following two sets. 

L; = {x G Ad I L(cc) = 0 } , 

^ { The union of all trajectories that start in E 

~ ( and remain in E for allt > 0. 

M is the “positively invariant part” of E. Now we can state the LaSalle 
invariance principle. 
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Theorem 8.3.1 (LaSalle, 1968) For all x G M, <pt{x) — >■ M as t ^ oo. 

Proof: First we argue that V = \ = constant on u^x) (see also Theorem 
15.0.3). This can be seen as follows. Suppose x G oj{x) and let x = V{x), 
then X is the greatest lower bound of the set {V{4>t{x)) 1 1 > 0}. This follows 
from the fact that V{x) decreases along trajectories (hence V{(j)ti{x)) > 
V{4>t{x)) > V{<j)ti+i{x))forti < t < ti+i) and by the continuity of V(x). 
From Proposition 8.1.3 the omega limit set of a trajectory is invariant, 
hence </>t(x) is also an omega limit point of <pt(x). Then, since x is the 
greatest lower bound of the set {V{<j)t{x)) 1 1 > 0}, V{(j)t{x)) = X- 

From this it follows that P = 0 on uj{x). Then, by the definition of E, 
Lo{x) C E. Since u){x) is invariant (Proposition 8.1.3), it follows by the 
definition of M that uj{x) C M. Therefore, (j>tix) — >■ M as t — >■ oo. □ 

This result is particularly useful when one has information about M . For 
example, consider the Duffing equation 

x = y, 

y = X — x^ — Sy, (x, y) G <5 > 0. 

Now consider the function 




It is easy to verify that 

V = -Sy^. 

Now consider the level set V = c for c very large, but finite. This curve 
defines the boundary of a positively invariant compact set. E is given by the 
intersection of the x axis with this set and M is the three equilbrium points 
on the X axis. The LaSalle invariance principle states that all trajectories 
converge to one of the equilibrium points. 

This example shows the relation between the Liapunov function V and 
the trapping region j\4 that often arises in specific applications. One first 
constructs a candidate for a Liapunov function. Then certain level sets of 
this function may be used to define j\4. 

We remark that the LaSalle invariance principle can be generalized con- 
siderably. In particular, versions exist for situations where we have non- 
uniqueness of trajectories, finite time blow up of trajectories, and for dis- 
crete time dynamical systems, see LaSalle [1968] for details and references. 



8.4 Attraction in Nonautonomous Systems 

The notions of attracting set, basin of attraction, and attractor developed 
in this chapter may not be adequate to describe the same phenomena in 




112 



8. Asymptotic Behavior 



nonautonomous systems. ^ In the nonautonomous setting attracting sets 
and basins of attraction may vary in time and the attraction rates my 
be nonuniform. The paper of Kloeden and Schmalfuss [1997] is tutorial in 
nature and provides an excellent introduction to the main issues. See also 
Kloeden and Stonier [1998], Knyazhishche and Shavel [1995], Liu [1993], 
and Yoshizawa [1985]. 

We discuss some aspects of attraction in nonautonomous systems follow- 
ing Kloeden and Schmalfuss [1997] and Griine and Kloeden [2001]. 

First, for nonautonomous systems a different notion of convergence of 
solutions to a specific solution arises. We begin with some motivation for 
this. Consider the following one-dimensional, linear nonautonomous vector 
field: 

x = —x + g{t), a: G M. (8.4.1) 

Using the variation of constants formula, it is easily computed that the 
trajectory through the point Xq at t = to is given by: 

a;(t, to, a^o) = f e^g{s)ds, (8.4.2) 

Jto 

where the function g{s) needs to be defined in such a way that the integral 
makes sense, say continuous with no more than polynomial growth in s as 
s — >■ ±oo. 

It is a simple matter to conclude by examining (8.4.2) that all trajectories 
of (8.4.1) converge to the trajectory 

</>(t) = e~* f e^g{s)ds, (8.4.3) 

J — OO 

as t — >■ oo (the reader should verify that (8.4.3) is indeed a trajectory of 
(8.4.1)). This is called forward convergence and is mathematically expressed 
as: 

lim \x{t,to,xo) — \ = 0, to, xo fixed. (8.4.4) 

t—¥00 

Hence, forward convergence is a property that is verified in the limit as 
t — >■ OO. However, suppose we were interested in a different, but related 
question. Namely, for the trajectory 4>(t), and a fixed, finite time t, char- 
acterize convergence of trajectories to </>(t) at the time t. This is a more 
practical question from the point of view of applications, as well as numer- 
ical analysis, and give rise to the notion of pullback convergence, which is 
mathematically expressed as follows: 



lim |x(t, to,2^o) ~ </'(^)| = 0, t, a;o fixed. (8.4.5) 

tQ—^ — CO 



^Of course, if the vector field is time periodic then one can reduce the problem 
to the study of a Poincare map (see Chapter 10) and then the results for maps 
apply immediately. 
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Notice that (8.4.3) is the only trajectory which all trajectories of (8.4.1) 
converge to as to c» (with t fixed). However, there are many trajectories 
to which all trajectories of (8.4.1) converge to as t — >■ oo (with to fixed). 
For example, all trajectories of (8.4.1) converge to 

0(t) = [ e''g{s)ds, 

J tQ 



for any fixed to- 

For nonautonomous systems pullback convergence and forward conver- 
gence are independent notions (see Exercises 16 and 17). However, they are 
equivalent for autonomous systems. Essentially this follows from Proposi- 
tion 7.4.1 which states that solutions of autonomous systems depend only 
on the “elapsed time”, t — tp. Hence, letting t — >■ oo, with to fixed, is equiv- 
alent to letting to — >■ — oo, with t fixed. 

The history of pullback convergence is discussed briefly in Griine and 
Kloeden [2001] who give much credit to the development of the idea to 
Krasnosel’skii [2001]. 

The notion of pullback convergence seems natural for defining attractors 
in nonautonomous systems, as we now describe. First, we establish some 
general notation. We begin by defining two notions of the distance between 
compact subsets of K”. 

Definition 8.4.1 (Hausdorff Separation) Let A and B be two non- 
empty, eompact subsets o/K". The Hausdorff separation of A and B, de- 
noted H*{A,B), is defined as: 

H* (A, B) = max dist (a, B) = max min [a — bj. (8.4.6) 

^ ' aeA ^ ’ aSA bSB ' ' ^ ^ 



Now we can define the Hausdorff metric. 

Definition 8.4.2 (Hausdorff Metric) 

H{A, B) = max {H*{A, B),H*{B, A)) . (8.4.7) 



Now we can address the issue of attraction in the pullback sense using 
the language of cocycles developed in Chapter 7. Let <pt,p, t G M., p £ P 
denote a cocycle on R” with respect to the group 0 of mappings on P (see 
Definition 7.5.1). 

Definition 8.4.3 (Pullback Attracting Set) A family A = {Ap, p £ P} 
of eompact subsets of M” is called a pullback attracting set of a cocycle 
{4>t,p, t £ M.,p £ P} on R” if it is invariant in the sense that: 



4^t,p ^Ap^ — Ag^p, t £ R, p £ P, 
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and pullback attracting in the sense that for every Ap there exists a bounded 
subset o/K", Dp D Ap, such that 

lim H* {(l>t,e.-tp{Dg__,p),Ap) = 0. 



The existence of a pullback attracting set is established through the 
notion of a pullback absorbing family, which we now define (compare with 
the definition of absorbing set given in Definition 8.2.3). 

Definition 8.4.4 (Pullback Absorbing Family) A family 

B = {Bp, p€ P}, 

of compact subsets o/K” is called a pullback absorbing family for a cocycle 
{4>t,p, t € G P} on M" if for each p G P there is a bounded subset of 
M", Up, satisfying Up D Bp, and a time tjjp > 0 such that: 

—tpiUg —tp) U- ^pi I — lup- 

Now we can state the main theorem on the existence of pullback attract- 
ing sets. 

Theorem 8.4.5 Let {4>t,pi t G G P{ be a cocycle of continuous map- 
pings on K" with a pullback absorbing family B = {Bp, p G P}. Then there 
exists a pullback attracting set A = {Ap, p G P{ with components uniquely 
determined by: 

T>0 t~>T 



Various versions of this theorem are discussed in Griine and Kloeden 
[2001], where the notion of Lyapunov functions for pullback attracting sets 
is also discussed. 



8.5 Exercises 

1. Let denote a flow generated by a (r > 1) vector field on that exists for 

all a: G , i G R. 

1) Show that the a and co limit sets of the flow are contained in the nonwandering 
set of the flow. 

2) Is the nonwandering set contained in the union of the a and cj limit set? 

2. Prove that the basin of attraction is independent of the choice of the open set U, 
provided that U satisfies Definition 8.2.1. 

3. Suppose A is an attracting set (of either a vector field or map), and suppose that x ^ A 
is a hyperbolic fixed point of saddle-type. Must the following be true 
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1) W‘(x) C A, 

2) W“(£) C A? 

4. Consider the union of the homoclinic orbit and the hyperbolic fixed point that it 
connects (shown in Figure 7.7.1). Can this set be an attracting set? 

5. Consider the (r > 1) diffeomorphism 

X I— g(x) , X G . 

Suppose that det Dg{x) — ly x G Prove that the diffeomorphism preserves volume. 

6 . Consider the following vector field on 



( 



xi 

^2 



) 





A > 0. 



The stable manifold of the origin is given by 

VF^(0, 0) — |(a:i, 3 : 2 ) G \x 2 — o}- 



Consider a line segment contained in VF^(0,0). Under the evolution of the flow gener- 
ated by the vector field the length of the line segment shrinks to zero as f ^ 00 . Does 
this violate the result of Exercise 5? Why or why not? 

7. Consider the (r > 1) diffeomorphism 

X I— g{x ) , X G . 

Suppose x = X is a nonwandering point, i.e., for any neighborhood U of there exists 
an n 7 ^ 0 such that g^{U) Pi U 7 ^ 0 (cf. Definition 8.1.4). Is it possible that there may 
exist only one such n, or, if there exists one n, must there be a countable (infinity?) 
of such n? Does the same result hold for flows? 



8 . Let B be an absorbing set. Show that: 



t>0 



is an attracting set. 

9. Describe the relation between trapping regions and absorbing sets. 

10. Apply the LaSalle Invariance principle to the vector field 

3 

X — X — X , 

V = -y, (x, v) G R^. 

and conclude that all trajectories converge to one of the equilibrium points. 

11. Consider the simple harmonic oscillator 

X = y, 

y = —X, (a:, y) G 

Using 

V(x,y) = i , 

as a Liapunov function, what does the LaSalle invariance principle say about this 
system? 
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12. Suppose that A4 is a positively invariant compact set for a flow which is the 

closure of some open set and has a boundary, and that V is a strict Liapunov 
function on A4 — {equilibrium points}. Then show that all trajectories converge to an 
equilibrium point. 

13. With M and A4 defined as in the discussion of the LaSalle invariance principle, if V 
is a Liapunov function on show that M is an attracting set and the interior of 

is in the basin of attraction. 

14. Suppose x{t, to, aio) is a trajectory of an autonomous vector field in that exists for 
all time. Consider the set 

7“ — {a: G \ x — xq), t G R} . 

Prove that 1~ is an invariant set. 

15. Suppose x{t, tQ, a^o) i® a trajectory of an nonautonomous vector field in R^ that exists 
for all time. Consider the set 

7“ — {a: G R^ \ x — x{t,to, xq), t G R} . 

Argue that 1~ is not generally an invariant set. 

16. Consider the vector field: 

X — —2tx, ai G R. 

Show that all trajectories are forward, but not pullback, convergent to a: = 0. 

17. Consider the vector field: 

X — 2tx, a: G R. 

Show that all trajectories are pullback, but not forward, convergent to a: = 0. 

18. Reformulate Definition 8.4.3 in terms of the trajectories of nonautonomous vector 
fields. 

19. Reformulate Definition 8.4.4 in terms of the trajectories of nonautonomous vector 
fields. 

20. Reformulate Theorem 8.4.5 in terms of the trajectories of nonautonomous vector fields. 

21. Consider the vector field: 

X = —X + sin t. 

The solution through the point xq at to is given by: 

x(t; to, Xq) — — (sin t — cos t) + ^a:o + — cos to — — sin to 

Construct the pullback attracting set. 

22. Consider the vector field: 

X — —x + t. 

The solution through the point xq at to is given by: 



(t-to) 



a:(t; a^o, to) — t — 1 + (xo — to + l)e 



Construct the pullback attracting set. 



23. Prove Theorem 8.4.5. 
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The Poincare-Bendixson 
Theorem 



The Poincare-Bendixson theorem gives us a complete determination of the 
asymptotic behavior of a large class of flows on the plane, cylinder, and two- 
sphere. It is remarkable in that it assumes no detailed information about 
the vector field, only uniqueness of solutions, properties of w limit sets, and 
some properties of the geometry of the underlying phase space. We begin 
by setting the framework and giving some preliminary definitions. 

We will consider C’’, r > 1, vector fields 

i = f{x,y), 

y = g{x,y), {x,y)G'P, 

where V denotes the phase space, which may be the plane, cylinder, or 
two-sphere. We denote the flow generated by this vector field by 

M-), 

where the in this notation denotes a point (x, y) G V. 

The following definition will be useful. 




FIGURE 9.0.1. 
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9. The Poincare-Bendixson Theorem 



Definition 9.0.1 Let be a continuous, connected arc in V. Then S is 
said to be transverse to the vector field on V if the vector dot product of 
the unit normal at each point on S with the vector field at that point is not 
zero and does not change sign on S. Or equivalently, since the vector field 
is C’’, r > 1, the vector field has no fixed points on S and is never tangent 
to S. 

Now we are in a position to actually prove the Poincare-Bendixson theo- 
rem. We will first prove several lemmas from which the theorem will follow 
easily. Our presentation follows closely Palis and de Melo [1982]. In all that 
follows, M is understood to be a positively invariant compact set in V. 
For any point p G V, we will denote the orbit of p under the flow for 
positive times 0+(_p) (also called the positive semiorbit of p). 

Lemma 9.0.2 Let H C A4 be an arc transverse to the vector field. The 
positive orbit through any point p G M., 0+{p), intersects T, in a monotone 
sequence; that is, if Pi is the intersection of 0^{p) with S, then Pi G 
[Pi-l,Pz+l]- 

Proof: Consider the piece of the orbit 0+(p) from Pi-i to Pi along with 
the segment [pi-i,pi] C S (see Figure 9.0.1). (Note: of course, if 0+(p) 
intersects S only once then we are done.) 

This forms the boundary of a positively invariant region T>. Hence, 
0+{pi) C V, and therefore we must have pi+i (if it exists) contained in 
T). Thus we have shown that Pi G [pi_i,pi+ij. □ 

We remark that Lemma 9.0.2 does not apply immediately to toroidal 
phase spaces. This is because the piece of the orbit from pi_i to pi along 
with the segment [pi-i,pi] C S needs to divide A4 into two “disjoint 
pieces.” This would not be true for orbits completely encircling a torus. 
However, the lemma would apply to pieces of the torus that behave as Ai 
described above. 

Corollary 9.0.3 The co-limit set of p (co{p)) intersects S in at most one 
point. 

Proof: The proof is by contradiction. Suppose co{p) intersects S in two 
points, qi and q 2 . Then by the definition of w-limit sets, we can And se- 
quences of points along 0+{p), {pn} and {p„}, which intersect S such that 
Pn — >qi as n t oo and Pn — >q 2 as n f oo. However, if this were true, then it 
would contradict the previous lemma on monotonicity of the intersections 
of 0+(p) with S. □ 

Lemma 9.0.4 Lf ui{p) does not contain fixed points, then to{p) is a closed 
orbit. 
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Proof: The strategy is to choose a point q G oj{p), show that the orbit of q 
is closed, and then show that uj{p) is the same as the orbit of q. 

Choose X € to(q); then x is not a fixed point, since to{p) closed and is 
a union of orbits containing no fixed points. Construct an arc transverse 
to the vector field at x (call it S). Now 0+{q) intersects S in a monotone 
sequence, {(?«}, with qn^x as n f oo, but since qn € to(p), by the previous 
corollary we must have qn = x for all n. Since x € u>(q), the orbit of q must 
be a closed orbit. 

It only remains to show that the orbit of q and co(p) are the same thing. 
Taking a transverse arc, S, at q, we see by the previous corollary that w(p) 
intersects S only at q. Since w(p) is a union of orbits, contains no fixed 
points, and is connected, we know that 0(q) = oj(p). □ 

Lemma 9.0.5 Let pi and p 2 be distinet fixed points of the vector field 
contained in ui{p), p G A4. Then there exists at most one orbit 7 C to{p) 
such that a{'j) = pi and uj(j) = P 2 - (Note: by 0(7) we mean the a limit 
set of every point on 7; similarly for cofj)-) 




FIGURE 9.0.2. 

Proof: The proof is by contradiction. Suppose there exist two orbits 71, 
72 G co{p) such that a{ji) = pi, w(7i) = P2, f = 1, 2. Choose points qi G 71 
and q 2 G 72 and construct arcs Si, S2 transverse to the vector field at each 
of these points (see Figure 9.0.2). 

Since 71, 72 C to{p), 0+{p) intersects Si in a point a and later intersects 
S 2 in a point b. Hence, the region bounded by the orbit segments and arcs 
connecting the points qi, a, b, < 72 , P 2 (shown in Figure 9.0.2) is a positively 
invariant region, but this leads to a contradiction, since 71, 72 C to{p). □ 



Now we can finally prove the theorem. 
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FIGURE 9.0.3. a) 0 < <5 < ^ 8 ; b) 5 > 

Theorem 9.0.6 (Poincare-Bendixson) Let Ai be a positively invariant 
region for the vector field containing a finite number of fixed points. Let 
p € Ai, and consider to{p). Then one of the following possibilities holds. 

i) ui{p) is a fixed point; 

ii) oj{p) is a closed orbit; 

iii) uj{p) consists of a finite number of fixed points pi, • • • and orbits 
7 with a{-f) = Pi and ^( 7 ) = pj. 

Proof: If uj{p) contains only fixed points, then it must consist of a unique 
fixed point, since the number of fixed points in Ai is finite and uj{p) is a 
connected set. 

If io{p) contains no fixed points, then, by Lemma 9.0.4, it must be a 
closed orbit. Suppose that to{p) contains fixed points and nonfixed points 
(sometimes called regular points). Let 7 be a trajectory in co^p) consisting 
of regular points. Then tu{'y) and 0 ( 7 ) must be fixed points since, if they 
were not, then, by Lemma 9.0.4, 0 ^( 7 ) and 0 ( 7 ) would be closed orbits, 
which is absurd, since uj{p) is connected and contains fixed points. 

We have thus shown that every regular point in uj{p) has a fixed point 
for an a and w limit set. This proves iii) and completes the proof of the 
Poincare-Bendixson theorem. □ 



For an example illustrating the necessity of a finite number of fixed 



9.1 Exercises 



121 



points in the hypotheses of Theorem 9.0.6 see Palis and de Melo [1982]. 
For generalizations of the Poincare-Bendixson theorem to arbitrary closed 
two-manifolds see Schwartz [1963]. 

Example 9.0.1 (Application to the Unforced Duffing Oscillator). 

We now want to apply the Poincare-Bendixson theorem to the unforced Duffing 
oscillator which, we recall, is given by 

x = y, 

y = X — — 6y, 5 > 0. 

Using the fact that the level sets olV{x,y) = y^/2 — x^/2-(-a;^/4 bound positively 
invariant sets for 5 > 0, we see that the unstable manifold of the saddle must fall 
into the sinks as shown in Figure 9.0.3. The reader should convince him- or herself 
that Figure 9.0.3 is rigorously justified based on analytical techniques developed 
in this chapter. Note that we have not proved anything about the global behavior 
of the stable manifold of the saddle. Qualitatively, it behaves as in Figure 9.0.4, 
but, we stress, this has not been rigorously justified. 



End of Example 9.0.1 



9.1 Exercises 

1. Use the Poincare-Bendixson theorem to show that the vector field 

X — fix — y — x\x + y j, 
y = X + p.y - y{x^ + y^), (x,y)eK^, 

has a closed orbit for y, > 0. {Hint: transform to polar coordinates.) 

2. Prove that for <5 > 0 the unstable manifold of the saddle- type fixed point of the 
unforced Duffing oscillator falls into the sinks as shown in Figure 9.0.3. 




FIGURE 9.0.4. 
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Poincare Maps 



The idea of reducing the study of continuous time systems (flows) to the 
study of an associated discrete time system (map) is due to Poincare [1899], 
who first utilized it in his studies of the three body problem in celestial 
mechanics. Nowadays virtually any discrete time system that is associated 
with an ordinary differential equation is referred to as a Poincare map. 
This technique offers several advantages in the study of ordinary differential 
equations, including the following: 

1. Dimensional Reduction. Construction of the Poincare map involves 
the elimination of at least one of the variables of the problem resulting 
in the study of a lower dimensional problem. 

2. Global Dynamics. In lower dimensional problems (say, dimension < 4) 
numerically computed Poincare maps provide an insightful and strik- 
ing display of the global dynamics of a system; see Guckenheimer and 
Holmes [1983] and Lichtenberg and Lieberman [1982] for examples of 
numerically computed Poincare maps. 

3. Conceptual Clarity. Many concepts that are somewhat cumbersome 
to state for ordinary differential equations may often be succinctly 
stated for the associated Poincare map. An example would be the 
notion of orbital stability of a periodic orbit of an ordinary differential 
equation (see Hale [1980]). In terms of the Poincare map, this problem 
would reduce to the problem of the stability of a fixed point of the 
map, which is simply characterized in terms of the eigenvalues of the 
map linearized about the fixed point. 

It would be useful to give methods for constructing the Poincare map as- 
sociated with an ordinary differential equation. Unfortunately, there exist 
no general methods applicable to arbitrary ordinary differential equations, 
since construction of the Poincare map requires some knowledge of the 
geometrical structure of the phase space of the ordinary differential equa- 
tion. Thus, construction of a Poincare map requires ingenuity specific to 
the problem at hand; however, in four cases that come up frequently, the 
construction of a specific type of Poincare map can in some sense be said 
to be canonical. The four cases are: 

1. In the study of the orbit structure near a periodic orbit of an ordinary 
differential equation. 
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2. In the case where the phase space of an ordinary differential equation 
is periodic, such as in periodically forced oscillators. 

3. In the study of the orbit structure near a homoclinic or heteroclinic 
orbit. 

4. In the study of two degree-of- freedom Hamiltonian systems. 

We begin by considering Case 1. 



10.1 Case 1: Poincare Map Near a Periodic Orbit 

Consider the following ordinary differential equation 

X = f{x), X G M", (10.1.1) 




FIGURE 10.1.1. The geometry of the Poincare map for a periodic orbit. 

where /:[/—>■ K" is C’’ on some open set U C M". Let denote the 

flow generated by (10.1.1). Suppose that (10.1.1) has a periodic solution of 
period T which we denote by (j)(t,xo), where xq G K" is any point through 
which this periodic solution passes (i.e., (j){t + T,xq) = (j){t,Xo))- Let S 
be an n — 1 dimensional surface transverse to the vector field at a;o (note: 
“transverse” means that f{x) ■ n(x) yf 0 where denotes the vector dot 
product and n{x) is the normal to S at a;); we refer to S as a cross-section 
to the vector field (10.1.1). Now in Theorem 7.1.1 we proved that 4>{t,x) 
is C” if f{x) is C”; thus, we can find an open set V C T, such that the 
trajectories starting in V return to S in a time close to T. The map that 
associates points in V with their points of first return to S is called the 
Poincare map, which we denote by P. To be more precise, 

P: U ^ S, 

X !—>■ ())(t(x), x) , 



( 10 . 1 . 2 ) 
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where t(x) is the time of first return of the point x to S. Note that, by 
construction, we have t(xo) = T and -P(xq) = xq- 

Therefore, a fixed point of P corresponds to a periodic orbit of (10.1.1), 
and a period k point of P (i.e., a point x € V such that P^{x) = x provided 
P'^(x) € V, i = I,--- ,k) corresponds to a periodic orbit of (10.1.1) that 
pierces S k times before closing; see Figure 10.1.1. 

In applying this technique to specific examples, the following questions 
immediately arise. 

1. How is S chosen? 



2. How does P change as S is changed? 



Question 1 cannot be answered in a general way, since in any given prob- 
lem there will be many possible choices of S. This fact makes the answer 
to Question 2 even more important. However, for now we will postpone 
answering this question in order to consider a specific example. 

Example 10.1.1. Consider the following vector field on 



X = fix — y — x{x^ + y'^), 
y = x + yy-y{x^ + y^), 



(x,y) e R^, 



(10.1.3) 



where /r € R^ is a parameter. Our goal is to study (10.1.3) by constructing 
an associated one-dimensional Poincare map and studying the dynamics of the 
map. According to our previous discussion, we need to find a periodic orbit of 
(10.1.3), construct a cross-section to the orbit, and then study how points on 
the cross-section return to the cross-section under the flow generated by (10.1.3). 
Considering (10.1.3) and thinking about how to carry out these steps should bring 
home the point stated at the beginning of this section — constructing a Poincare 
map requires some knowledge of the geometry of the flow generated by (10.1.3). 
In this example the procedure is greatly facilitated by considering the vector field 
in a “more appropriate” coordinate system; in this case, polar coordinates. 

Let 



X = rcosO, 
y = r sinS; 



(10.1.4) 



then (10.1.3) becomes 



3 

r = yr — r , 
6 = 1 . 



(10.1.5) 



We will require /r > 0, in which case the flow generated by (10.1.5) is given by 



cl)t{ro,do) 




( 10 . 1 . 6 ) 



It should be clear that (10.1.5) has a periodic orbit given by (j>t{y/Jl,9o). We now 
construct a Poincare map near this periodic orbit. 

We define a cross-section S to the vector field (10.1.5) by 

E = { (r, 6») € R X 5^ I r > 0, 9 = 6io}. 



(10.1.7) 
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The reader should verify that S is indeed a cross-section. From (10.1.5) we see 
that the “time of flight” for orbits starting on S to return to S is given hy t — 2 tt. 
Using this information, the Poincare map is given by 



P: E S 

(ro, 9o) I— >• 027r(ro, 6^o) — ® , do + 2 -k^ , ^ ^ 



or simply 




(10.1.9) 



where we have dropped the subscript ‘0’ on r for notational convenience. The 
Poincare map has a fixed point at r = ,JJi. We can compute the stability of 
the fixed point by computing the eigenvalue (which is just the derivative for a 
one-dimensional map) of DP{y/JI). A simple calculation gives 



PP(V/f) = 



( 10 . 1 . 10 ) 



Therefore, the fixed point r = is asymptotically stable. 

Before leaving this example there are several points to make. 

1. Viewing (10.1.3) in the correct coordinate system was the key to this prob- 
lem. This made the choice of a cross-section virtually obvious and provided 
“nice” coordinates on the cross-section (i.e., r and 6 “decoupled” as well). 
Later we will learn a general technique called normal form theory which 
can be used to transform vector fields into the “nicest possible” coordinate 
systems. 

2. We know that the fixed point of P corresponds to a periodic orbit of (10.1.5) 
and that the fixed point of P is asymptotically stable. Does this imply that 
the corresponding periodic orbit of (10.1.5) is also asymptotically stable? 
It does, but we have not proved it yet (note: the reader should think about 
this in the context of this example until it feels “obvious” ) . We will consider 
this point when we consider how the Poincare map changes when the cross- 
section is varied. 



End of Example 10.1.1 

Before leaving Case 1, let us illustrate how the study of Poincare maps 
near periodic orbits may simplify the geometry. 

Consider a vector field in generating a flow given by x G 

Suppose also that it has a periodic orbit, 7 , of period T > 0 passing through 
the point Xq G i.e., 

M^o) = <Pt+T{xo). 

We construct in the usual way a Poincare map, P, near this periodic orbit 
by constructing a cross-section, S, to the vector held through xq and con- 
sidering the return of points to S under the flow generated by the vector 
held; see Figure 10.1.1. 
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FIGURE 10.1.2. The geometry of the Poincare map 

Now consider the Poincare map P. The map has a fixed point at xq. 
Suppose that the fixed point is of saddle type having a one-dimensional 
stable manifold, IT®(a;o), and a one-dimensional unstable manifold IU“(xo); 
see Figure 10.1.2. We now want to show how these manifolds are manifested 
in the flow and how they are related to 7 . Very simply, using them as 
initial conditions, they generate the two-dimensional stable and unstable 
manifolds of 7 . Mathematically, this is represented as follows 




FIGURE 10.1.3. 



w^(7) = U MwrM), 

t<0 

w^“(7) = U 

t>0 
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It should be clear that IP® (7) (resp. IP“(7)) is just as differentiable as 
IPj®c(^o) (resp. W^^{xo)), since is differentiable with respect to x] 

see Figure 10.1.3 for an illustration of the geometry. Hence, in IP®(7) 
and IP“(7) are two two-dimensional surfaces which intersect in the closed 
curve. This should serve to show that it is somewhat simpler geometrically 
to study periodic orbits and their associated stable and unstable manifolds 
by studying the associated Poincare map. 

We now turn to Case 2. 



10.2 Case 2: The Poincare Map of a Time-Periodic 
Ordinary Differential Equation 



Consider the following ordinary differential equation 

x = f{x,t), x G K", (10.2.1) 

where f:U^ K" is C’’ on some open set U C M" x Suppose the time 
dependence of (10.2.1) is periodic with fixed period T = 2tt/lo > 0, i.e., 
f{x,t) = f{x,t + T). We rewrite (10.2.1) in the form of an autonomous 
equation in n -I- 1 dimensions (see Chapter 7) by defining the function 



9 : ^ 

1 1 -^ 9{t) = ojt, mod 27 T. 



( 10 . 2 . 2 ) 



Using (10.2.2) equation (10.2.1) becomes 

X = fix, 9), 0) G R” X M. (10.2.3) 



We denote the fiow generated by (10.2.3) by (j>{t) = (x(t),0(t) = ujt + 
9o (mod27r)). We define a cross-section to the vector field (10.2.3) by 

={(x,9) X \ 9 = 9 qG (0,27t]}. (10.2.4) 

The unit normal to in R” x is given by the vector (0, 1), and it is 
clear that is transverse to the vector field (10.2.3) for all x G R", since 
[f{x,9),uj) • (0, 1) = w yf 0. In this case is called a global cross-section. 
We define the Poincare map of as follows: 








— 9n 27T 



9q 2tt = 9q , 



f 9o — 9o\ f 9o — 9 q 2tt 



or 



X 



(10.2.5) 
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Thus, the Poincare map merely tracks initial conditions in a; at a fixed 
phase after successive periods of the vector field. 

It should be clear that fixed points Pg^ correspond to 27r/w-periodic 
orbits of (10.2.1) and fc-periodic points of Pg^ correspond to periodic orbits 
of (10.2.1) that pierce k times before closing. We will worry about the 
effect on the dynamics of the map caused by changing the cross-section 
later. Now we consider an example. 



10.2a Periodically Forced Linear Oscillators 

Consider the following ordinary differential equation 

X + 5x + ujqX = jcosLot. (10.2.6) 

This is an equation which most students learn to solve in elementary cal- 
culus courses. Our goal here is to study the nature of solutions of (1.2.17) 
from our more geometrical setting in the context of Poincare maps. This 
will enable the reader to obtain a new point of view on something relatively 
familiar and, we hope, to see the value of this new point of view. 

We begin by first obtaining the solution of (10.2.6). Recall (see, e.g., 
Arnold [1973] or Hirsch and Smale [1974]) that the general solution of 
(1.2.17) is the sum of the solution of the homogeneous equation (i.e., the 
solution for 7 = 0), sometimes called the free oscillation, and a particular 
solution, sometimes called the forced oscillation. For <5 > 0 there are several 
possibilities for the homogeneous solution, which we state below. 

5 > 0: The Homogeneous Solution, Xh(t) 

There are three cases depending on the sign of the quantity — dwg . 

(a) (5^ - > 0 ^ Xh{t) = C'le’’!* -k C'ze’’"*, (10.2.7) 

where 

n^2 = -S/2±{l/2)^S^-4u;l 

{b) (5^ - 4wg = 0 ^ Xh{t) = (Cl -I- 

(c) 6^ — 4wg < 0 Xh(t) = cos wt -I- C2 sinOt), 

and where to = (1/2)a/4wq — In all three cases Ci and C2 are unknown 
constants which are fixed when initial conditions are specified. Also, no- 
tice that in all three cases lim Xh{t) = 0. We now turn to the particular 

t—¥(X) 

solution. 
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The Particular Solution, Xp { t ) 

The particular solution is given by 

Xp { t ) = A cos U!t + B sin cut, (10.2.8) 

where 

_ (^0 - _ Sjw 

~ (w§ _ ^2)2 + (J^)2 ’ “ (wg - u;2)2 + (Sujy ■ 

Next we turn to the construction of the Poincare map. For this we will 
consider only the case — 4wq < 0. The other two cases are similar, and 
we leave them as exercises for the reader. 

The Poincare Map; <5^ - 4w§ < 0 
Rewriting (10.2.6) as a system, we obtain 
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We construct a cross-section at = 0 (note: this is why we specified the 
initial conditions at t = 0) as follows 

S° = S = {{x,y,9) G Ri X Ri X S'M = 0 G [0,27 t)}, (10.2.14) 

where we have dropped the subscript “0” on x, y, and 9 for notational 
convenience. Using (10.2.12), the Poincare map is given by 



P: S 



where 




2 



C- 45 S 



e-W- [_aC+{-^A-^B] S 



^ — STrfu 



—toBC 







C = cos27t — , 

UJ 



(10.2.15) 



S = sin 27 t— . 

UJ 

Equation (10.2.15) is an example of an affine map, i.e., it is linear map 
plus a translation. 

The Poincare map has a single fixed point given by 

{x,y) = {A,ujB) (10.2.16) 

(note: this should not be surprising). The next question is whether or not 
the fixed point is stable. A simple calculation shows that the eigenvalues 
of DP{A, ujB) are given by 

Ai,2 = (10.2.17) 

Thus the fixed point is asymptotically stable with nearby orbits appear- 
ing as in Figure 10.2.1. (Note: the “spiraling” of orbits near the fixed point 
is due to the imaginary part of the eigenvalues.) Figure 10.2.1 is drawn for 
A > 0; see (10.2.8). 



The Case of Resonance: lo = oj 

We now consider the situation where the driving frequency is equal to the 
frequency of the free oscillation. In this case the the Poincare map becomes 



P: S 



!->■ e 



— S-KjuJ 




f A{l-e~^^/^) \ 
\ujB{l-e-^^/‘^) )' 



(10.2.18) 
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FIGURE 10.2.1. 

This map has a unique fixed point at 

{x,y) = {A,ujB). (10.2.19) 

The eigenvalues of DP{A,coB) are identical and are equal to 

A = 6"'^’"/“. (10.2.20) 

Thus, the fixed point is asymptotically stable with nearby orbits appearing 
as in Figure 10.2.2. (Note: in this case orbits do not spiral near the fixed 
point since the eigenvalues are purely real.) 

For 5 > 0, in all cases the free oscillation dies out and we are left with 
the forced oscillation of frequency w which is represented as an attracting 
fixed point of the Poincare map. We will now examine what happens for 
5 = 0. 

5 = 0: Subharmonics, Ultraharmonics, and Ultrasubharmonics 
In this case the equation becomes 



x = y, 

y = —u!qX + 7 cos 9, 
9 = uj. 



( 10 . 2 . 21 ) 



Using (10.2.7c) and (10.2.8), we see that the general solution of (10.2.21) 
is given by 



x{t) = Cl cos u>ot + C 2 sin ojQt + A cos ivt, 
ylt) = x(t), 



( 10 . 2 . 22 ) 



1 - ^ 

_ ^2 ’ 



where 



(10.2.23) 
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FIGURE 10.2.2. 



and Cl and C 2 are found by solving 



x(0) = xo = Cl + A, 
2/(0) = 2/0 = C2Wo- 



(10.2.24) 



It should be evident that, for now, we must require w yf wq in order for 

(10.2.22) to be valid. 

Before writing down the Poincare map, there is an important distinction 
to draw between the cases S > 0 and (5 = 0. As mentioned above, for <5 > 0, 
the free oscillation eventually dies out leaving only the forced oscillation 
of frequency uj. This corresponds to the associated Poincare map having a 
single asymptotically stable fixed point. In the case i5 = 0, by examining 

(10.2.22) , we see that this does not happen. In general, for <5 = 0, it should 
be clear that the solution is a superposition of solutions of frequencies w 
and ojo- The situation breaks down into several cases depending on the 
relationship of w to ojq. We will first write down the Poincare map and 
then consider each case individually. 

The Poincare map is given by 



P: S ^ S, 

/ x\ ( cos 27 t^ -f sin 27 T^\ 

\2/ j ^ V-wosin 27 r^ cos 27 t^ )\y) 



(10.2.25) 



/ A( 1 -cos 27 t^)\ 

woAsin27T^ J 

Our goal is to study the orbits of P. As mentioned above, this will depend 
on the relationship of w and loq- We begin with the simplest case. 



Harmonic Response 
Consider the point 



{x,y) = (A,0). 



(10.2.26) 
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It is easy to verify that this is a fixed point of P corresponding to a solution 
of (10.2.21) having frequency co. 

We now want to describe a somewhat more geometrical way of viewing 
this solution which will be useful later on. Using (10.2.26), (10.2.24) and 
(10.2.22), the fixed point (10.2.26) corresponds to the solution 



x{t) = A cos ojt, 
y{t) = —Aijjsviiujt. 



(10.2.27) 



If we view this solution in the x-y plane, it traces out a circle which 
closes after time 2-k juj. 

If we view this solution in the x-y-9 phase space, it traces out a spiral which 
can be viewed as lying on the surface of a cylinder. 

The cylinder can be thought of as an extension of the circle traced out by 
(10.2.27) in the x-y plane into the 0-direction. Since 9 is periodic, the ends 
of the cylinder are joined to become a torus, and the trajectory traces out 
a curve on the surface of the torus which makes one complete revolution 
on the torus before closing. The torus can be parameterized by two angles; 
the angle 9 is the longitudinal angle. We will call the latitudinal angle 9q, 
which is the angle through which the circular trajectory turns in the x-y 
plane. This situation is depicted geometrically in Figure 10.2.3. 

Trajectories which wind many times around the torus may be somewhat 
difficult to draw, as in Figure 10.2.3; we now want to show an easier way to 
represent the same information. First, we cut open the torus and identify 
the two ends as shown in Figure 10.2.4. Then we cut along the longitudinal 
angle 9 and fiatten it out into a square as shown in Figure 10.2.5. This 
square is really a torus if we identify the two vertical sides and the two 
horizontal sides. 

This means that a trajectory that runs off the top of the square reappears 
at the bottom of the square at the same 9 value where it intersected the 
top edge. For a more detailed description of trajectories on a torus, see 
Abraham and Shaw [1984]. We stress that this construction works because 
all trajectories of (10.2.21) lie on circles in the x-y plane. Motion on tori is 
a characteristic of multifrequency systems. 



Subharmonic Response of Order m 
Suppose we have 

u! = mu)o, m > 1, (10.2.28) 

where m is an integer. Consider all points on S except (x,y) = (A, 0) 
(we already know about this point). Using (10.2.22) and the expression 
for the Poincare map given in (10.2.25), it is easy to see that all points 
except (x,y) = (A, 0) are period m points, i.e., they are fixed points of the 
m**' iterate of the Poincare map (note: this statement assumes that by the 
phrase “period of a point” we mean the smallest possible period). Let us 
now see what they correspond to in terms of motion on the torus. 
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FIGURE 10.2.3. 




e 



FIGURE 10.2.4. 

Using (10.2.28) and (10.2.22) it should be clear that x{t) and y{t) have 
frequency uj/m. Thus, after a time t = 27r/w, the solution has turned 
through an angle 27 t/to in the x-y plane, i.e., 9q has changed by 2T:lm. 
Therefore, the solution makes m longitudinal circuits and one latitudinal 
circuit around the torus before closing up. The m distinct points of inter- 
section that the trajectory makes with 6* = 0 are all period m points of 
P, or equivalently, fixed points of the iterate of P. Such solutions are 
called subharmonics of order m. In Figure 10.2.6 we show examples for 
m = 2 and m = 3. 

Ultraharmonic Response of Order n 

Suppose we have 



nuj = ojQ, 



n > 1, 



(10.2.29) 
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FIGURE 10.2.5. 
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m = 3 

FIGURE 10.2.6. 

where n is an integer. Consider all points on S except {x, y) = {A, 0). Using 
(10.2.29) and (10.2.22) it is easy to see that every point is a fixed point of 
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the Poincare map. Let us see what this corresponds to in terms of motion 
on the torus. 



t 

»0 

I 




■^0 
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FIGURE 10.2.7. 

Using (10.2.22) and (10.2.29), we see that x{t) and y{t) have frequency 
mo. This means that after a time t = 27r/w, the solution has turned through 
an angle 27rn in the x-y plane before closing up. Since 27rn = 27 t (mod27r), 
this explains the nature of these fixed points of P: they correspond to 
solutions which make n latitudinal and one longitudinal circuits around the 
torus before closing up. We illustrate the situation geometrically for n = 2 
and n = 3 in Figure 10.2.7. Such solutions are referred to as ultraharmonics 
of order n. 

Ultrasubharmonic Response of Order m, n 
Suppose we have 

nuj = mjjjQ, TO, n > 1, (10.2.30) 

where to and n are relatively prime integers, which means that all com- 
mon factors of n/m have been divided out. Using exactly the same argu- 
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merits as those given above, it is easy to show that all points in S except 
{x, y) = {A, 0) are period m points which correspond to trajectories making 
m longitudinal and n latitudinal circuits around the torus before closing 
up. These solutions are referred to as ultrasubharmonics of order m, n. We 
illustrate the situation for (n,m) = (2,3) and (n,m) = (3,2) in Figure 
10 . 2 . 8 . 




0 n/2 n 3ti/2 2n 

(n,m) = (3,2) 



FIGURE 10.2.8. 



Quasiperiodic Response 
For the final case, suppose we have 

— = irrational number. (10.2.31) 

UJo 

Then for all points in S except {x, y) = {A, 0), the orbit of the point densely 
fills out a circle on S which corresponds to an invariant two-torus in x-y-6 
space. We will prove this rigorously in Section 10.4a. 






138 



10. Poincare Maps 



10.3 Case 3: The Poincare Map Near a Homoclinic 
Orbit 



We now want to give an example of the construction of a Poincare map in 
the neighborhood of a homoclinic orbit. Rather than getting entangled in 
technical details, we will concentrate on a specific example in two dimen- 
sions which illustrates the main ideas. 

Consider the ordinary differential equation 



X = ax + fi{x,y; y), 
y = fiy + f2{x,y, Ii), 



{x,y,fj,) G X X 



(10.3.1) 



with /i, /2 = 0{\x\'^ + |yp) and C’’, r >2 and where y, is regarded as a 
parameter. We make the following hypotheses on (10.3.1). 




FIGURE 10.3.1. Behavior of the homoclinic orbit as y is varied. 
Hypothesis 1. a < 0, /3 > 0, and a + j3 ^ Q. 

Hypothesis 2. At /i = 0 (10.3.1) possesses a homoclinic orbit connecting the 
hyperbolic fixed point (x,y) = (0,0) to itself, and on both sides of y = 0 
the homoclinic orbit is broken. Furthermore, the homoclinic orbit breaks 
in a transverse manner in the sense that the stable and unstable manifolds 
have different orientations on different sides of /i = 0. For definiteness, we 
will assume that, for /x < 0, the stable manifold lies inside the unstable 
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manifold, for /i > 0, the stable manifold lies outside the stable manifold 
and, for /r = 0, they coincide; see Figure 10.3.1. 

Hypothesis 1 is of a local nature, since it concerns the nature of the 
eigenvalues of the vector field linearized about the fixed point. Hypothesis 
2 is global in nature, since it supposes the existence of a homoclinic orbit 
and describes the nature of the parameter dependence of the homoclinic 
orbit . 

Now an obvious question is why this scenario? Why not stable inside 
for /i > 0 and unstable inside for /r < 0? Certainly this could happen; 
however, this is not important for us to consider at the moment. We need 
to know only that, on one side of /i = 0, the stable manifold lies inside the 
unstable manifold, and on the other side of /i = 0, the unstable manifold 
lies inside the stable manifold. Of course, in applications, we would want to 
determine which case actually occurs, and later on we will learn a method 
for doing this (Melnikov’s method); however, now we will simply study the 
consequences of a homoclinic orbit to a hyperbolic fixed point of a planar 
vector field breaking in the manner described above. 

Let us remark that it is certainly possible for the eigenvalues a and (3 
to depend on the parameter /x. However, this will be of no consequence 
provided that Hypothesis 1 is satisfied for each parameter value and that 
this is true for ^ sufficiently close to zero. 

The question we ask is the following: What is the nature of the orbit 
structure near the homoclinic orbit for ^ near /x = 0? We will answer 
this question by computing a Poincare map near the homoclinic orbit and 
studying the orbit structure of the Poincare map. The Poincare map that 
we construct will be very different from those we constructed in Cases 1 
and 2 in that it will be the composition of two maps. One of the maps, Pq, 
will be constructed from the flow near the origin (which we will take to be 
the flow generated by the linearization of (10.3.1) about the origin). The 
other map. Pi, will be constructed from the flow outside of a neighbor- 
hood of the fixed point, which, if we remain close enough to the homoclinic 
orbit, can be made to be as close to a rigid motion as we like. The result- 
ing Poincare map, P, will then be given by P = Pi o Pq. Evidently, with 
these approximations, our Poincare map will be valid (meaning that its dy- 
namics reflect the dynamics of (10.3.1)) only when it is defined sufficiently 
close to the (broken) homoclinic orbit. We will discuss the validity of our 
approximations later on, but for now we begin our analysis. 

The analysis will proceed in several steps. 

Step 1. Set up the domain for the Poincare map. 

Step 2. Compute Pq. 

Step 3. Compute Pi. 

Step 4. Examine the dynamics of P = Pi o Pq. 
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Step 1: Set Up the Domain for the Poincare Map. For the domain of Pq we 
choose 

50 = { (a;, y) G I a; = e > 0, y > 0}, (10.3.2) 

and for the domain Pi we choose 

51 = { (a;, y) G I a; > 0, y = e > 0}. (10.3.3) 

We will take e small; the need for this will become apparent later on. See 
Figure 10.3.2 for an illustration of the geometry of Sq and Si. 




FIGURE 10.3.2. 



Step 2: Compute Pq. We will use the flow generated by the linear vector 
held 

(10.3.4) 

V = Py, 

in order to compute the map, of points on Sq to Si. For this to be a 
good approximation, it should be clear that we must take e and y small. 
We will discuss the validity of this approximation later. 

The flow generated by (10.3.4) is given by 



x{t) = xoe“*, 
y{t) = yoe^K 



(10.3.5) 



The time of flight, T, needed for a point (e, j/o) G Sq to reach Si under 
the action of (10.3.5) is given by solving 



e = 2/oe 



f3T 



(10.3.6) 



1 £ 
T=-log~. 
P Vo 



to obtain 



(10.3.7) 
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From (10.3.7) it is clear that we must require t/o ^ £• 



-fo- 51o Si, 
(s,yo) ^ (e 





(10.3.8) 



Step 3: Compute P\. Using Theorem 7.1.1, by smoothness of the flow with 
respect to initial conditions and the fact that it only takes a finite time to 
flow from Si to Sq along the homoclinic orbit, we can And a neighborhood 
[/ C Si which is mapped onto Sq under the flow generated by (10.3.4). We 
denote this map by 

Pi{x,£]pl) = {Pii{x,£-,ij),Pi2{x,£-,pi ))\ [/ C Si -)> So, (10.3.9) 

where Pi(0,£;0) = (£,0). Taylor expanding (10.3.9) about {x,£]fi) = 
(0, £; 0) gives 

Pi{x, £; p,) = {e, ax + bp) + 0{2). (10.3.10) 

The expression “0(2)” in (10.3.10) represents higher order nonlinear terms 
which can be made small by taking £, x, and p small. For now, we will 
neglect these terms and take as our map 



Pp. [/ C Si — >■ So, 

(x, £) I-T (£, ax + bp), 



(10.3.11) 



where a > 0 and 6 > 0. The reader should study Figure 10.3.1 to determine 
why we must have a, b > 0. 

Step 4: Examine the Dynamics of P = P\ o Pq. We have 



P — P\o Pq: U C So — >■ So, 



(e,2/o) 



^£, as 




where V = (Pq) ^{U), or 

P{y,p):y^Ay^'^/^^+bp, 



(10.3.12) 



(10.3.13) 



where A = > 0 (we have left the subscript “0” off the yo for the 

sake of a less cumbersome notation). (Note: of course, we are assuming also 
that U is sufficiently small so that (Po)~^{U) C So.) 

Let 6 = |a//3|; then a + P ^ 0 implies 5^1. We will seek fixed points of 
the Poincare map, i.e., y £ V such that 



P{y\h) = Ay^ + bp = y. (10.3.14) 



The fixed points can be displayed graphically as the intersection of the 
graph of P{y, p) with the line y = P{y, p) for fixed p. 

There are two distinct cases. 




142 



10. Poincare Maps 




FIGURE 10.3.3. Graph of P for /i > 0, /i = 0, and fi < 0 with <5 > 1. 




fl<0 




FIGURE 10.3.4. Phase plane of (10.3.1) for 5 > 1. 

Case 1: |a| > \(3\ or 5 > 1 

For this case DyP{0; 0) = 0, and the graph of P appears as in Figure 10.3.3 
for ^ > 0, /r = 0, and /i < 0. Thus, for /x > 0 and small /x, (10.3.13) has a 
fixed point. 

The fixed point is stable and hyperbolic, since 0 < DyP < 1 for /x suffi- 
ciently small. By construction we therefore see that this fixed point cor- 
responds to an attracting periodic orbit of (10.3.1) (provided that we can 
justify our approximations); see Figure 10.3.4. We remark that if the homo- 
clinic orbit were to break in the manner opposite to that shown in Figure 
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10.3.1, then the fixed point of (10.3.13) would occur for /r < 0. 




FIGURE 10.3.5. Graph of P for /i > 0, /r = 0, and /r < 0 with 5 < 1. 




n<o 




n-o 




n>o 



FIGURE 10.3.6. Phase plane of (1.2.50) for 5 < 1. 



Case 2: |o;| < |/3| or S < 1 

For this case, DyP{0; 0) = oo, and the graph of P appears as in Figure 
10.3.5. Thus, for < 0, (10.3.13) has a repelling fixed point. By construc- 
tion we can therefore conclude that this corresponds to a repelling periodic 
orbit for (10.3.1); see Figure 10.3.6. We remark that if the homoclinic orbit 
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were to break in the manner opposite to that shown in Figure 10.3.1, then 
the fixed point of (10.3.13) would occur for /r > 0. 

We summarize our results in the following theorem. 

Theorem 10.3.1 Consider a system where Hypothesis 1 and Hypothesis 
2 hold. Then we have, for pL sufficiently small: i) If a + P < 0, there exists 
a unique stable periodic orbit on one side of p, = 0; on the opposite side of 
p there are no periodic orbits, ii) If a + P > 0, the same conclusion holds 
as in i), except that the periodic orbit is unstable. 

We remark that if the homoclinic orbit breaks in the manner opposite 
that shown in Figure 10.3.1, then Theorem 10.3.1 still holds except that the 
periodic orbits occur for p values having the opposite sign as those given 
in Theorem 10.3.1. Theorem 10.3.1 is a classical result which can be found 
in Andronov et al. [1971]. Additional proofs can be found in Guckenheimer 
and Holmes [1983] and Chow and Hale [1982]. 

Before leaving this example we must address an important point, which is 
that we have not rigorously proven Theorem 10.3.1, since the Poincare map 
we computed was only an approximation. We must therefore show that the 
dynamics of the exact Poincare map are contained in the dynamics of the 
approximate Poincare map. Because our main goal is to demonstrate how 
to construct a Poincare map near a homoclinic orbit, we refer the reader 
to Wiggins [1988] and Bakaleinikov and Silbergleit [1995a, b] for the proof 
of this fact under the condition that we remain sufficiently close to the 
(broken) homoclinic orbit, i.e., for e and p sufficiently small. 



10.4 Case 4: Poincare Map Associated with a Two 
Degree-of-Freedom Hamiltonian System 



The study of two degree-of-freedom Hamiltonian systems can often be re- 
duced to the study of two-dimensional maps, although this reduction is 
typically local in the phase space. Here we describe how this can be done. 

Consider a two degree-of-freedom Hamiltonian system with Hamiltonian 
given by 

H{xi,X2,yi,y2), (a;i, X2, 2/1, 2/2) G 

where Xi — yi,i= 1, 2, are the canonically conjugate pairs of variables. The 
level set of the Hamiltonian, or energy surface, i.e., 

H{xi,X2,yi,y2) = h, (10.4.1) 



is typically three-dimensional. In particular, if 



dH 

dy2 



7 ^ 0 : 



(10.4.2) 
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then it is three-dimensional and, by the implicit function theorem, on a 
fixed energy surface we can solve for j /2 as a function of xi, X 2 , yi, and h, 
i.e., 

V 2 = y 2 {xi,X 2 ,yi,h). (10.4.3) 

Hence, locally the level set of the Hamiltonian is represented by the graph 
of this function. 

Now consider the three-dimensional hyperplane given by X 2 = constant. 
Then a vector normal to this hyperplane is given by 

iV=(0,l,0,0), 

and a vector normal to the energy surface is given by 

^ m dH \ 

V9a;i’ dx2^ dyi dy2j ' 

Hence, we see that (10.4.2) implies that VH and N cannot be parallel, and 
therefore the three-dimensional hyperplane X 2 = constant intersects the 
three-dimensional energy surface transversely in a two-dimensional surface, 
which we denote by S. S, or some subset of it, is our candidate for the 
Poincare section. Since the energy surface is locally represented by the 
graph of (10.4.3), we can take x\ and yi as coordinates on S. Let S C S 
denote the subset of S for which N ■ VH is nonzero and of one sign, and 
trajectories with initial conditions on S return to S. This latter condition 
is the most problematic. It is certainly true if X 2 is an angular variable, and 
it is reasonable to expect it to be true if the energy surface is bounded, but 
insuring that this condition holds is part of the art of constructing Poincare 
sections. Once all these conditions are satisfied the Poincare map is given 
by 

P : S^S, 

(a;i(0),yi(0)) (a;i(r),?/i(T)) , (10.4.4) 

where r = r (xi(0), yi(0); h) is the time for a trajectory with initial condi- 
tion (xi(0), j/i(0)) on S to return to S. 

Let us carry out this procedure for the two-degree-of-freedom Hamilto- 
nian system given in Example 5.2b, where all the calculations can be done 
explicitly. From (5.2.1), the equations restricted to the energy surface are 
given by 

A = 0 , 

61 = Oi, 

62 = O 2 , 

with the energy surface given by the graph of the function 
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The Poincare section is parametrized by the coordinates I\ — 9i and, since 
02 is an angular coordinate, all points on S return to S after time 
Hence the Poincare map is given by 

P : S ^ S, 

{Ii,9i) !->• + 2tt—). 

Therefore S is foliated by invariant circles given by I\ = constant and the 
dynamics on each invariant circle is given by i— >■ + 27 t^. 



10.4a The Study of Coupled Oscillators via 
Circle Maps 

In both Section 10.4 and Section 5.2b we saw that (for C, I 2 ^ 0) the study 
of two linearly coupled, linear undamped oscillators in a four-dimensional 
phase space could be reduced to the study of the following two-dimensional 
vector field 

{9i,02) & X S\ (10.4.5) 

The flow generated by (10.4.5) is defined on the two-torus, x = T^, 
and 6*1 and 6*2 are called the longitude and latitude; see Figure 10.4.1. As in 
Example 10.2a, it is often easier to visualize flows on tori by cutting open 
the torus, flattening it out, and identifying horizontal and vertical sides 
of the resulting square as shown in Figure 10.4.2. The flow generated by 
(10.4.5) is simple to compute and is given by 



9i{t) — ui\t + 6 * 10 , 
6*2(t) = UJ2t + 020, 



(mod 27 t). 



(10.4.6) 



However, orbits under this flow will depend on how oji and 0 J 2 are related. 




FIGURE 10.4.1. 
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FIGURE 10.4.2. 

Definition 10.4.1 and u )2 are said to he incommensurate if the equa- 
tion 

muji + nuj2 = 0 

has no solutions eonsisting of n, m G Z (integers). Otherwise, uji and UJ 2 
are commensurate. 

Theorem 10.4.2 If toi and ui 2 are commensurate, then every phase curve 
of ( 10 . 4 . 5 ) is closed. However, ifui\ anduj 2 are incommensurate, then every 
phase curve of (10.4.5) is everywhere dense on the torus. 

To prove this theorem, we need the following lemma. 

Lemma 10.4.3 Suppose the circle is rotated through an angle a, and 
a is incommensurate with 2 tt. Then the sequence 



S = {9,9 a,9 2a, ■■■ ,9 na, ■ ■ ■ , (mod 27 t)} 

is everywhere dense on the circle (note: n is an integer). 

Proof: 

{ 9 + ma if ma — 2 tt < 0, 

9 + {ma — 2Trk) if ma — 2nk > 0,k > 1 

and ma — 2Tr{k + 1) < 0, 

so, in particular, since a and 27t are incommensurate, the sequence S is 
infinite and never repeats. 

We will use the “pigeonhole principle,” i.e., if you have n holes and n + 1 
pigeons, then one hole must contain at least two pigeons. 

Divide the circle into k half-open intervals of equal length 2Tr/k. Then, 
among the first fc -I- 1 elements of the sequence S, at least two must be in 
the same half-open interval; call these points 9 pa, 9 ga(mod 27t) with 
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p > q. Thus, (p — q)a = sa < 27r/fc(mod 27 t). Any two consecutive points 
of the sequence S given by 



S = {9,0 + sa, 9 + 2sa, ■ ■ ■ ,9 + nsa, ■ ■ ■ , (mod 27 t)} 

are therefore the same distance d apart, where d < 2 tt jk (note that S C S'). 

Now choose any point on S^ and construct an £-neighborhood around it. 
If k is chosen such that 2Tr/k < e, then at least one of the elements of S 
will lie in the e-neighborhood. This proves the lemma. □ 



Now we prove Theorem 10.4.2. 

Proof: First, suppose lo\ and W 2 are commensurate, i.e., 3 n, m G Z such 
that = {nfm)uj 2 - We construct a Poincare map as follows. Let the cross- 
section S be defined as 



S"i« = {(0i,02) |0i = 0io}. 
Then, using (10.4.7), we have 

Pg,,: ^ 

02 02 UJ2 . 



(10.4.7) 



(10.4.8) 



However, u; 2 /wi = m/n; hence, we have 

771 

02 '->■ ^^2 + 27t — ( mod27r). (10.4.9) 

n 

This is a map of the circle onto itself (called a circle map); the number 
W 2 /W 1 is called the rotation number. (Rotation numbers are also defined 
for nonlinear circle maps, as we shall see later.) 

It is clear that the iterate of this map is given by 

02 02 + 27rm(mod27r) = 02- (10.4.10) 

Thus, every 02 is a periodic point; hence the flow consists entirely of closed 
orbits. This proves the first part of the theorem. 

Now suppose u>i and u >2 are incommensurate; then 102 ! = a, where a 
is irrational. The Poincare map is then given by 

02 I— >■ 02 + 27TQ;(mod 27 t); (10.4.11) 

thus, by Lemma 10.4.3, the orbit of any point 02 is dense in the circle. 

Next choose any point p on and construct an e-neighborhood of p. To 
finish the proof of Theorem 10.4.2 we need to show that, given any orbit 
on T^, it eventually passes through this e-neighborhood of p. This is done 
as follows. 




10.5 Exercises 



149 



First, we are able to construct a new cross-section which passes 
through the ^-neighborhood of p; see Figure 10.4.3. We have seen that the 
orbits of P^io- — >■ are all dense on for any 0iq. Therefore, we 

can take any point on and look at its first intersection point with 
under the flow (10.4.6). From this it follows that the iterates of this point 
under are dense in This completes the proof. □ 




FIGURE 10.4.3. 



Let us make a final remark before leaving this example. In our intro- 
ductory motivational remarks we stated that Poincare maps allow a di- 
mensional reduction of the problem by at least one. In this example, we 
have seen how the study of a four-dimensional system can be reduced to 
the study of a one-dimensional system. This was possible because of our 
understanding of the geometry of the phase space; i.e., the phase space was 
made up of families of two-tori. It will be a common theme throughout this 
book that a good qualitative feel for the geometry of the phase space will 
put us in the best position for quantitative analysis. 

Finally, we note that these results for linear vector fields on actually 
remain true for nonlinear differentiable vector fields on T^, namely, that 
the (V limit sets for vector fields with no singular points are either closed 
orbits or the entire torus; see Hale [1980]. 



10.5 Exercises 



Exercises 1 — 3 are concerned with the properties of the following Poincare map. 
Consider the Poincare map given in (10.2.15) 
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/e-^"/“[-AC + - %B)S] + A 



U) UJ 

C — cos 2tz — , S — sin 27 t — , 

UJ UJ 

^ Sjuj 

’ (o;§ - (^2)2 + (5^)2 ’ (o;2-a;2)2 + (5i^)2- 

1. Show that {x,y) — {A^ujB) is the only fixed point for this map and, hence, argue that it 

is a global attractor. 

2. Discuss the nature of the orbit structure near (a;, y) = {A, ujB) for different values of uj juj. 

3. Show how the fixed point of the Poincare map changes as the cross-section (i.e., phase 

angle) is varied. 

4. Construct and study the Poincare map for 

X - y, 

y = —uj'^x 7 cos 9, 

9 — ujQ. 



where 



and 




Exercises 5—9 are concerned with the properties of the following Poincare map. 

Consider the Poincare map {uj ^ cjq) given in (10.2.25) 

/ cos 2-k ^ ^ sin 2-k ^ \ / a; \ 

V— cJosin27r^ cos27t^ ) \V ) 

/ 24(1 — cos 27 t^) \ 

\^aJo24sin27T^ j 

5. For UJ — mujQ, m > 1, show that all points except (a;, y) — (A, 0) are period m points. 

6. For nuj — ujq, n > 1, show that all points are fixed points. 

7 . Discuss the orbit structure of the Poincare map when ujfujQ is an irrational number. 

8. Discuss stability of the harmonics, subharmonics, ultraharmonics, and ultrasubharmonics. 

9. Recall that a map, /, preserves orientation when det D f > 0. Show that the four types 

of Poincare maps discussed in this section preserve orientation. What would be the 
consequences if they did not? 

10 . Consider an n degree-of-freedom Hamiltonian system. Following the discussion in Section 

10.4, discuss how it can be studied via a 2n — 2 dimensional Poincare map. 

11 . Show that the Poincare maps for Hamiltonian systems described in Section 10.4 and 

Exercise 10 preserve volume. 
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Conjugacies of Maps, and 
Varying the Cross-Section 



We now turn to answering the question of how the choice of cross-section 
affects the Poincare map. The point of view that we develop will be that 
Poincare maps defined on different cross-sections are related by a (in gen- 
eral, nonlinear) coordinate transformation. The importance of coordinate 
transformations in the study of dynamical systems cannot be overesti- 
mated. For example, in the study of systems of linear constant coefficient 
ordinary differential equations, coordinate transformations allow one to de- 
couple the system and hence reduce the system to a set of decoupled linear 
first-order equations which are easily solved. In the study of completely 
integrable Hamiltonian systems, the transformations to action-angle coor- 
dinates results in a trivially solvable system (see Arnold [1978]), and these 
coordinates are also useful in the study of near integrable systems. If we 
consider general properties of dynamical systems, coordinate transforma- 
tions provide us with a way of classifying dynamical systems according to 
properties which remain unchanged after a coordinate transformation. In 
Chapter 12 we will see that the notion of structural stability is based on 
such a classification scheme. 

Before considering Poincare maps, we want to discuss coordinate trans- 
formations, or, to use the more general mathematical term, conjugacies, 
giving some results that describe properties which must be retained by a 
map or vector field after a coordinate transformation of a specific differen- 
tiability class. Let us begin with an example which should be familiar to 
the reader. 

Example 11.0.1. We want to motivate how coordinate transformations affect 
the orbits of maps. 

Consider two linear, invertible maps 

X !->■ Ax, ® € R" (11.0.1) 

y I— >■ By, y € R". (11.0.2) 

For xo G R", we denote the orbit of xo under A by 

Oa{xo) = {•••, A~"xo, ■ ■ ■ , A~^xo,xo,Axo, ■■■ , A"xo, ■ ■ •}, (11.0.3) 

and, for yo G R", we denote the orbit of j/o under B by 
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Os(j/o) = ^yo,yo,Byo,--- (11.0.4) 

Now suppose A and B are related by a similarity transformation, i.e., there is an 
invertible matrix T such that 



B = TAT~^. 



(11.0.5) 



We could think of T as transforming A into B, and, hence, since it does no harm 
in the linear setting to confuse the map with the matrix that generates it, T 
transforms (11.0.1) into (11.0.2). We represent this in the following diagram 

K" ^ K" 




( 11 . 0 . 6 ) 



The question we want to answer is this: when (11.0.1) is transformed into (11.0.2) 
via (11.0.5), how are orbits of A related to orbits of B? To answer this question, 
note that from (11.0.5) we have 

B" = TA"T~^ for all n. (11.0.7) 

Hence, using (11.0.7) and comparing (11.0.1) and (11.0.5), we see that orbits of 
A are mapped to orbits of B under the transformation y — Tx. Moreover, we 
know that since similar matrices have the same eigenvalues, the stability types 
of these orbits coincide under the transformation T. 



End of Example 11.0.1 



Now we want to consider coordinate transformation in a more general, 
nonlinear setting. However, the reader will see that the essence of the ideas 
is contained in this example. 

Let us consider two C’’ diffeomorphisms /: K” — >■ K” and g:W^ — >■ K”, 
and a diffeomorphism /i:M" — >■ R". 

Definition 11.0.1 (Conjugacy) / and g are said to he conjugate 
(k < r) if there exists a diffeomorphism h: M" — >■ K” such that g oh = 
ho f. If k = 0, f and g are said to be topologically conjugate. 



The conjugacy of two diffeomorphisms is often represented by the fol- 
lowing diagram. 



h 





( 11 . 0 . 8 ) 



The diagram is said to commute if the relation goh = ho f holds, meaning 
that you can start at a point in the upper left-hand corner of the diagram 
and reach the same point in the lower right-hand corner of the diagram by 
either of the two possible routes. We note that h need not be defined on all 
of M" but possibly only locally about a given point. In such cases, / and g 
are said to be locally conjugate. 

If / and g are conjugate, then we have the following results. 
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Proposition 11.0.2 If f and g are conjugate, then orbits of f map to 
orbits of g under h. 

Proof: Let xq G M"; then the orbit of xq under / is given by 

O(a;o) = {•••, f~"‘{xo),- ■ ■ , f~'^{xo),xo, f{xo), ■■■, /"(xq), • • •}. (11.0.9) 

From Definition 11.0.1, we have that / = h~^ o g o h, so for a given n > 0 
we have 

/"(xq) = {h~^ o goh)o {h~^ o g o h) o ■ ■ ■ o o go h){xo) 

'' V ' 

n factors 

= h~^ o g'^ o h{xo) (11.0.10) 

or 

ho P(xo) = g"^ oh{xo). (11.0.11) 

Also from Definition 1.2.2, we have that f~^ = h~^ o g~^ o h, so by the 
same argument, for n > 0 we obtain 

o /~”(xo) = o /i(xo). (11.0.12) 

Therefore, from (11.0.10) and (11.0.12) we see that the orbit of xq under / 
is mapped by h to the orbit of ft-(xo) under g. □ 

Proposition 11.0.3 If f and g are conjugate, k> 1, and Xq is a fixed 
point of f, then the eigenvalues of Df{xo) are equal to the eigenvalues of 
Dg{h{xo)). 

Proof: From Definition 11.0.1, f{x) = h~^ ogoh{x). Note that since xq is a 
fixed point then h~^ o goh{xo) = xq. Also, by the inverse function theorem, 
we have Dh~^ = {Dh)~^. Using this and the fact that h is differentiable, 
we have 

Df\ =Dh~^ DgL, ,Dh\ . (11.0.13) 

Therefore, recalling that similar matrices have equal eigenvalues gives the 
result. □ 



Now we will return to the specific question of what happens to the 
Poincare map when the cross-section is changed. We begin with Case 1, 
a Poincare map defined near a periodic orbit. 
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11.1 Case 1: Poincare Map Near a Periodic Orbit: 
Variation of the Cross-Section 

Let xq and x\ be two points on the periodic solution of (10.1.1), and let 
So and Si be two (n — l)-dimensional surfaces at xq and xi, respectively, 
which are transverse to the vector field, and suppose that Si is chosen 
such that it is the image of Sq under flow generated by (10.1.1); see Figure 
11.1.1. By Theorem 7.1.1, this defines a C*" diffeomorphism 



/i:So^Si. (11.1.1) 

We define Poincare maps Pg and Pi as in the previous construction. 



Pq- ho — f So, 




Xg H> (I>{t{xo),Xo), 


Xg &Vg C. So 


Pi'- hi Si, 




Xi H> (1>{t{xi),Xi), 


Xi GVi C Si 



( 11 . 1 . 2 ) 

(11.1.3) 




FIGURE 11.1.1. The cross-sections Eq and Ei. 
Then we have the following result. 

Proposition 11.1.1 Pg and Pi are locally C’’ conjugate. 
Proof: We need to show that 



Pi o h = ho Pq, 

from which the result follows immediately since h is a, C’’ diffeomorphism. 
However, we need to worry a bit about the domains of the maps. We have 

h{^o) = Si, 

PoiVg) C So, 

Pi(l^i)cSi. 



(11.1.4) 




11.2 Variation of the Cross-Section of the Poincare Map: Case 2 
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Thus, ft- o Pq: Vb — 1 Si is well defined but Pi o ft need not be defined, since 
Pi is not defined on all of Si; however, this problem is solved if we choose 
Si such that Vi = ft(Vb) and take Vq sufficiently small. □ 



11.2 Case 2: The Poincare Map of a Time-Periodic 
Ordinary Differential Equation: Variation of 
the Cross-Section 



Consider the Poincare map Pg^ defined on the cross-section S®“ as defined 
in (10.2.5). Suppose we construct a different Poincare map, Pg_^, in the same 
manner but on the cross-section 



S®i = { (x, ft) G K” X 5*0 ft = fti G (0, 27 t] } . 



( 11 . 2 . 1 ) 



Then we have the following result. 

Proposition 11.2.1 Pg^ and Pg^ are C’’ conjugate. 

Proof: The proof follows a construction similar to that given in Proposition 
11.1.1. We construct a O'" diffeomorphism, ft, of into by mapping 
points on into under the action of the flow generated by (10.2.3). 

Points starting on have initial time to = (fto — do) /to, and they reach 

after time 



thus we have 

ft: ^ , 

9q - ftp 

OJ 



'o X 



h-ftp 

OJ 



,ftl 



( 11 . 2 . 2 ) 



Using (11.2.2) and the expression for the Poincare maps defined on the 
different cross-sections, we obtain 



hoPg^: 



ftp — ftp 



OJ 



ftp pE 



fti — ftp T 27 t 



OJ 



and 



Pg^ o ft: ^ , 

ftp — ftp 



OJ 



,(7p 



Of) ] 1-^ 



0i — ^0 “ 1 “ 27 t 



LO 



+ 27 t = 9i 

(11.2.3) 



9i -|- 27T = C?! 

(11.2.4) 
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Thus, from (11.2.3) and (11.2.4), we have that 

hoPg^=Pg^oh. □ (11.2.5) 

Therefore, Propositions 11.0.2 and 11.0.3 imply that, as long as we re- 
main sufficiently close to the periodic orbit, changing the cross-section does 
not have any dynamical effect in the sense that we will still have the same 
orbits with the same stability type. However, geometrically there may be 
an apparent difference in the sense that the locations of the orbits as well 
as their stable and unstable manifolds may “move around” under a change 
in cross-section. It may also be possible that an intelligent choice of the 
cross-section could result in a “more symmetric” Poincare map which could 
facilitate the analysis. We will see an example of this later. 

We note that the case of a Poincare map near a homoclinic orbit can be 
treated in the same way with the same results. We leave this as an exercise 
for the reader. 

We remark that it should be clear from these results that a Poincare 
map constructed according to Case 2 (i.e., the global cross-section) has 
information concerning all possible dynamics of the vector field. When only 
a local cross-section can be constructed, then the Poincare map will not, 
in general, contain information on all possible dynamics of the vector field. 
Different Poincare maps defined on different cross-sections may not have 
the same dynamics. 
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Structural Stability, Genericity, 
and Transversality 



The mathematical models we devise to make sense of the world around 
and within us can only be approximations. Therefore, it seems reasonable 
that if they are to accurately reflect reality, the models themselves must be 
somewhat insensitive to perturbations and have properties that are “not 
atypical” , in a sense that is not easy to characterize in a way that is useful in 
applications. The attempts to give mathematical substance to these rather 
vague ideas have led to the concept of structural stability and genericity, 
which have played an important role historically in the development of 
dynamical systems theory as a mathematical subject. Another concept that 
we define in this chapter is transversality. This is an example of a “typical 
property” arising in a number of settings that is amenable to concrete 
calculations. We will see that it will play an important role in local and 
global bifurcation theory, as well as in characterizing chaotic dynamics. 

Before defining structural stability and genericity, let us consider a spe- 
cific example which illustrates many of the issues that need to be addressed. 

Example 12.0.1. Consider the simple harmonic oscillator 

2 ( 12 . 0 . 1 ) 
y = -LOoX, 

We know everything about this system. It has a nonhyperbolic fixed point of 
{x, y) = (0, 0) surrounded by a one-parameter family of periodic orbits, each hav- 
ing frequency uiq. The phase portrait of (12.0.1) is shown in Figure 12.0.1 (note: 
strictly speaking, the phase curves are circles for uiq = 1 and ellipses otherwise). 
Is (12.0.1) stable with respect to perturbations (note: this is a new concept of sta- 
bility, as opposed to the idea of stability of specific solutions discussed in Chapter 
1)? Let us try a few perturbations and see what happens. 



Linear, Dissipative Perturbation 
Consider the perturbed system 



x = y, 

y = -ulx - ey. 



( 12 . 0 . 2 ) 



It is easy to see that the origin is a hyperbolic fixed point, a sink for e > 0 and 
a source for e < 0. However, all the periodic orbits are destroyed (use Bendixson’s 
criteria). Thus, this perturbation radically alters the structure of the phase space 
of (12.0.1); see Figure 12.0.2. 
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FIGURE 12.0.1. 




e>o 




e<o 

FIGURE 12.0.2. 



Nonlinear Perturbation 
Gonsider the perturbed system 

x = y, 

y = —uiqX + ex^ . 

The perturbed system now has two fixed points given by 

(x,y) = (0,0), 

(x,y) = (wo/e,0). 



(12.0.3) 



(12.0.4) 
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The origin is still a center (i.e., unchanged by the perturbation), and the new 
fixed point is a saddle and far away for e small. 

This particular perturbation has the property of preserving a first integral. In 
particular, (12.0.3) has a first integral given by 

= y + ^ (12.0.5) 




e>o 




FIGURE 12.0.3. 

This enables us to draw all phase curves for (12.0.3), which are shown in Figure 
12.0.2. From Figure 12.0.3, we make the following observations. 

1. This particular perturbation preserves the symmetry of (12.0.3) implied by 
the existence of a first integral. Therefore, sufficiently close to (x, y) = (0, 0) 
the phase portraits of (12.0.1) and (12.0.3) look the same. However, for 
(12.0.3), it is important to note that the frequency of the periodic orbits 
changes with distance from the origin, as opposed to (12.0.1). 

2. The phase space of (12.0.1) is unbounded. Therefore, no matter how small 
we take e, far enough away from the origin the perturbation is no longer a 
small perturbation. This is evidenced in Figure 12.0.3 by the saddle point 
and the homoclinic orbit connecting it to itself. Thus, there is a problem 
in discusing perturbations of vector fields on unbounded phase spaces. 
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Time- Dependent Perturbation 



Consider the system 



x = y, 

y = —u>oX + ex cost. 



( 12 . 0 . 6 ) 



This perturbation is of a very different character than the previous two. Writing 
(12.0.6) as an autonomous system (see Chapter 7) 



x = y, 

y=-uJoX-\-£xcos6, (12.0.7) 

0 = 1 , 

we see that the time-dependent perturbation has the effect of enlarging the di- 
mension of the system. However, in any case, {x, y) = (0, 0) is still a fixed point 
of (12.0.6), although it is interpreted as a periodic orbit of (12.0.7). We now ask 
what the nature of the flow is near (x,y) = (0,0), which is a difficult question 
to answer due to the time dependence. Equation (12.0.6) is known as the Math- 
ieu equation, and for wo = ri/2, n an integer, it is possible for the system to 
exhibit parametric resonance resulting in a solution starting near the origin that 
grows without bound. Thus, the flow of (12.0.7) near the origin differs very much 
from the flow of (12.0.1) near the origin. For more information on the Mathieu 
equation see Nayfeh and Mook [1979]. 



End of Example 12.0.1 

This simple example illustrates several points that need to be considered 
when discussing whether or not a system is stable under perturbations. 

Specification of the Space of Dynamical Systems: It is important 
to specify the type of perturbations that are allowed. For example, 
if the system has a symmetry, then one might want to consider only 
perturbations which preserve the symmetry. The idea of structural 
stability thus depends on the type of dynamical system under con- 
sideration. 

Quantifying “Closeness” of Dynamical Systems: In discussing the 
idea of a perturbation of a dynamical system, it is necessary to spec- 
ify what it means for two vector fields or maps to be “close.” In our 
example we used an e and required e to be small. However, we saw 
that this did not work well when the phase space was unbounded. 

Quantifying Qualitatively Similar Dynamics: It is necessary to quan- 
tify the statement “two dynamical systems have qualitatively the 
same dynamics.” This must be specified if one is to decide when a 
system is structurally stable. 

Up to this point our discussion has been very heuristic. Indeed, our main 
purpose has been to get the reader to worry about whether the systems they 
are studying are stable under perturbations. We will see throughout this 
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book, especially when we study bifurcation theory, that a consideration of 
this question often reveals much about the underlying dynamics of dynam- 
ical systems. However, now we want to say a little about the mathematical 
formulation of the notion of structural stability. 



12.1 Definitions of Structural Stability and 
Genericity 

The concept of structural stability was introduced by Andronov and Pon- 
tryagin [1937] and has played a central role in the development of dynami- 
cal systems theory. Roughly speaking, a dynamical system (vector field or 
map) is said to be structurally stable if nearby systems have qualitatively 
the same dynamics. Therefore, in defining structural stability one must 
provide a recipe for determining when two systems are “close,” and then 
one must specify what is meant by saying that, qualitatively, two systems 
have the same dynamics. We will discuss each question separately. 

Let C”(K",K") denote the space of C” maps of K" into K”. In terms 
of dynamical systems, we can think of the elements of C”(R",R") as be- 
ing vector fields. We denote the subset of C’’(R”,M”) consisting of the C” 
diffeomorphisms by DifF”(R”,lR”). We remark that if one is studying dy- 
namical systems that have certain symmetries, then additional constraints 
must be put on these spaces. 

Two elements of C”(K",]R") are said to be C” £-close {k < r), or just 
close, if they, along with their first k derivatives, are within £ as measured 
in some norm. There is a problem with this definition; namely, R” is un- 
bounded, and the behavior at infinity needs to be brought under control. 
The reader should consider this in the context of the example described 
at the beginning of this chapter. This explains why most of the mathe- 
matical theory of dynamical systems has been developed using compact 
phase spaces; however, in applications this is not sufficient and appropriate 
modifications must be made. 

There are several ways of handling this difficulty. For the purpose of our 
discussion we will choose the usual way and assume that our maps act on 
compact, boundary less n-dimensional differentiable manifolds, M , rather 
than all of R". The topology induced on C^{M,M) by this measure of 
distance between two elements of C^{M,M) is called the topology, and 
we refer the reader to Palis and de Melo [1982] or Hirsch [1976] for a more 
thorough discussion. 

The question of what is meant by saying that two dynamical systems are 
close is usually answered in terms of conjugacies. Specifically, C° conjugate 
maps have qualitatively the same orbit structure in the sense of the propo- 
sitions given in Chapter 11. For vector fields there is a similar notion to 
conjugacies for maps called a equivalence. We will discuss this in more 
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detail in Chapter 20 when we study bifurcation theory (note: in some sense 
the study of bifurcation theory will be the study of structural instability) . 
In this section we will state the definitions for maps along with vector fields; 
the reader should refer back to these definitions when we study the related 
ideas for vector fields. 

We are now at the point where we can formally define structural stability. 

Definition 12.1.1 (Structural Stability) Consider a map f G Diff’^ 
(M,M) (resp. a C’’ vector field in C'"{M,M)); then f is said to be struc- 
turally stable if there exists a neighborhood Af of f in the topology such 
that f is C° conjugate (resp. C° equivalent) to every map (resp. vector 
field) in M . 

Now that we have defined structural stability, it would be nice if we 
could determine the characteristics of a specific system which result in 
that system being structurally stable. From the point of view of the applied 
scientist, this would be useful, since one might presume that a dynamical 
system used to model phenomena occurring in nature should possess the 
property of structural stability. Unfortunately, such a characterization does 
not exist, although some partial results are known, which we will describe 
shortly. One approach to the characterization of structural stability has 
been through the identification of typical or generic properties of dynamical 
systems, and we now discuss this idea. 

Naively, one might expect a typical or generic property of a dynamical 
system to be one that is common to a dense set of dynamical systems in 
C’’(M, M). This is not quite adequate, since it is possible for a set and its 
complement to both be dense. For example, the set of rational numbers 
is dense in the real line, and so is its complement, the set of irrational 
numbers. However, there are many more irrational numbers than rational 
numbers, and one might expect the irrationals to be more typical than the 
rationals in some sense. The proper topological sense in which this is true 
is captured by the idea of a residual set. 

Definition 12.1.2 (Residual Set) Let X be a topological space, and let 
U be a subset of X. U is called a residual set if it contains the intersection 
of a countable number of sets, each of which are open and dense in X. If 
every residual set in X is itself dense in X , then X is called a Baire space. 

We remark that C^{M,M) equipped with the C’’ topology {k < r) is a 
Baire space (see Palis and deMelo [1982]). We now give the definition of a 
generic property. 

Definition 12.1.3 (Generic Property) A property of a map (resp. vec- 
tor field) is said to be generic if the set of maps (resp. vector fields) 
possessing that property contains a residual subset in the topology. 
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Residual sets have played the central role in characterizing genericity in 
the development of dynamical systems theory as a mathematical subject. 
Upon an initial consideration of the definition, one would expect that resid- 
ual sets capture “most” of the points in a space. However, the manner in 
which one mathematically captures the concept of “most”, is rather subtle. 
Residual sets are a topological notion. One might also consider a proba- 
bilistic or measure theoretic notion of “typical”. For example, on the unit 
interval the rational numbers have Lebesgue meaure zero and the irrational 
numbers have Lebesgue measure one (full measure). Such a characteriza- 
tion is not equivalent to the topological characterization since a residual set 
can have Lebesgue measure zero and an open and dense subset of K" can 
have arbitrarily small Lebesgue measure. Hence, certain generic properties 
could occur with zero probability. An excellent discussion of these ideas, 
as well as a development of these ideas in a measure-theoretic framework, 
can be found in Hunt et al. [1992]. 

Below we list some generic properties of dynamical systems (in the sense 
of the property holding for a residual subset of the appropriately defined 
space). 

Example 12.1.1 (Examples of Structurally Stable and Generic Properties). 

• Hyperbolic fixed points and periodic orbits are structurally stable and 
generic. 

• The transversal intersection (see Section 12.2) of the stable and unsta- 
ble manifolds of hyperbolic fixed points and periodic orbits is structurally 
stable and generic. 



End of Example 12.1.1 

The proof of these statements in a carefully defined class of dynamical 
systems comprises the Kupka-Smale Theorem. Details of the proof of this 
theorem can be found in Palis and de Melo [1992]. 

In utilizing the idea of a generic property to characterize the structurally 
stable systems, one first identifies some generic property. Then, since a 
structurally stable system is C° conjugate (resp. equivalent for vector fields) 
to all nearby systems, structurally stable systems must have this property 
if the property is one that is preserved under conjugacy (resp. equiv- 
alence for vector fields). One would like to go the other way with this 
argument; namely, it would be nice to show that structurally stable sys- 
tems are generic. For two-dimensional vector fields on compact manifolds, 
we have the following result due to Peixoto [1962]. 

Theorem 12.1.4 (Peixoto’s Theorem) A C” vector field on a compact 
houndaryless two-dimensional manifold M is structurally stable if and only 

if 




164 



12. Structural Stability, Genericity, and Transversality 



i) the number of fixed points and periodic orbits is finite and each is 
hyperbolic; 

ii) there are no orbits connecting saddle points; 

iii) the nonwandering set consists of fixed points and periodic orbits. 

Moreover, if M is orientable, then the set of such vector fields is open and 
dense in C'"{M,M) (note: this is stronger than generic). 

This theorem is useful because it spells out precise conditions under 
which the dynamics of a vector field on a compact boundaryless two mani- 
fold are structurally stable. Unfortunately, we do not have a similar theorem 
in higher dimensions. This is in part due to the presence of complicated 
recurrent motions (e.g., the Smale horseshoe; see Chapter 23) which are 
not possible for two-dimensional vector fields. Even more disappointing is 
the fact that structural stability is not a generic property for n-dimensional 
diffeomorphisms (n > 2) or n-dimensional vector fields (n > 3). This fact 
was first demonstrated by Smale [1966]. 

At this point we will conclude our brief discussion of the ideas of struc- 
tural stability and genericity. For more information, we refer the reader 
to Chillingworth [1976], Hirsch [1976], Arnold [1983], Nitecki [1971], Smale 
[1967], and Shub [1987]. However, before ending this section, we want to 
comment on the relevance of these ideas to the applied scientist, i.e., some- 
one who must discover what types of dynamics are present in a specific 
dynamical system. 

Genericity and structural stability as defined above have been guiding 
forces behind much of the development of dynamical systems theory. The 
approach often taken has been to postulate some “reasonable” form of dy- 
namics for a certain class of dynamical systems and then to prove that this 
form of dynamics is structurally stable and/or generic within this class. If 
one is persistent with this approach one is occasionally successful and even- 
tually a significant catalogue of generic and structurally stable dynamical 
properties is obtained. This catalogue is useful to the applied scientist in 
that it gives some idea of what dynamics to expect in a specific dynami- 
cal system. However, this is hardly adequate. Given a specific dynamical 
system, is it structurally stable and/or generic? 

We would like to give computable conditions under which a specific dy- 
namical system is structurally stable and/or generic. For certain special 
types of motions such as periodic orbits and fixed points, this can be done 
in terms of the eigenvalues of the linearized system. However, for more gen- 
eral, global motions such as homoclinic orbits and quasiperiodic orbits, this 
cannot be done so easily, since the nearby orbit structure may be exceed- 
ingly complicated and defy any local description. What this boils down to is 
that to determine whether or not a specific dynamical system is structurally 
stable, one needs a fairly complete understanding of its orbit structure, or 
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to put it more cynically, one needs to know the answer before asking the 
question. It might therefore seem that these ideas are of little use to the 
applied scientist; however, this is not exactly true, since the theorems de- 
scribing structural stability and generic properties do give one a good idea 
of what to expect, although they cannot tell what is precisely happening in 
a specific system. Also, the reader should always ask him or herself whether 
or not the dynamics are stable and/or typical in some sense. Probably the 
best way of mathematically quantifying these two notions for the applied 
scientist has yet to be determined. 



12.2 Transversality 

Before leaving this section let us introduce the idea of transversality, which 
will play a central role in many of our geometrical arguments. 

Transversality is a geometric notion which deals with the intersection of 
surfaces or manifolds. Let M and N be differentiable (at least C^) manifolds 
in M". 

Definition 12.2.1 (Transversality) Let p be a point mM"; then M and 
N are said to be transversal at p if p ^ M C\ N ; or, if p G M C\ N, then 
TpM + TpN = K”, where TpM and TpN denote the tangent spaces of M 
and N, respectively, at the point p. M and N are said to be transversal if 
they are transversal at every point p G M"; see Figure 12.2.1. 



Whether or not the intersection is transversal can be determined by 
knowing the dimension of the intersection of M and N . This can be seen 
as follows. Using the formula for the dimension of the intersection of two 





FIGURE 12.2.1. M and N transversal at p. 
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vector subspaces we have 

dim(TpM + TpN) = dimTpM + dimTpiV - dim(TpM n TpN). (12.2.1) 

From Definition 12.2.1, if M and N intersect transversely at p, then we 
have 

n = dimTpM + dimTpA^ — dim(TpM n TpN). (12.2.2) 

Since the dimensions of M and N are known, then knowing the dimension 
of their intersection allows us to determine whether or not the intersection 
is transversal. 

Note that transversality of two manifolds at a point requires more than 
just the two manifolds geometrically piercing each other at the point. Con- 
sider the following example. 

Example 12.2.1. Let M be the x axis in and let N be the graph of the 
function f{x) = x^\ see Figure 12.2.2. Then M and N intersect at the origin in 
but they are not transversal at the origin, since the tangent space of M is 
just the X axis and the tangent space of N is the span of the vector (1,0); thus, 
T(o,o)N = T(^o,o)M and, therefore, T(o,o)N + T(^o,o)M yf R^. 



End of Example 12.2.1 



y 




FIGURE 12.2.2. Nontransversal manifolds. 

The most important characteristic of transversality is that it persists 
under sufficiently small perturbations. This fact will play a useful role in 
many of our geometric arguments; we remark that a term often used syn- 
onymously for transversal is general position, i.e., two or more manifolds 
which are transversal are said to be in general position. 

Let us end this section by giving a few “dynamical” examples of transver- 
sality. 

Example 12.2.2. Gonsider a hyperbolic fixed point of a C”, r > 1, vector 
field on R". Suppose the matrix associated with the linearization of the vector 
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field about the fixed point has n — k eigenvalues with positive real part and 
k eigenvalues with negative real part. Thus this hxed point has an (n — k)- 
dimensional unstable manifold and a fc-dimensional stable manifold. If these two 
manifolds intersect in a point, other than the fixed point, then by uniqueness of 
solutions and invariance of the manifolds, they must intersect along a (at least) 
one-dimensional orbit. Hence, by (12.2.2), the intersection cannot be transverse. 



End of Example 12.2.2 

Example 12.2.3. Suppose the vector field of Example 12.2.2 is Hamiltonian 
so that all orbits are restricted to lie in (n — l)-dimensional “energy” surfaces 
given by the level sets of the Hamiltonian. Then it is possible for the stable and 
unstable manifolds of the hyperbolic fixed point to intersect transversely in the 
(n — 1)- dimensional energy surface. 



End of Example 12.2.3 

Example 12.2.4. Consider a hyperbolic periodic orbit of a C”, r > 1, vector 
field on K". Suppose that the Poincare map associated with the periodic orbit 
linearized about the hxed point has n — k — 1 eigenvalues with modulus greater 
than one and k eigenvalues with modulus less than one. Then the periodic orbit 
has an (n — fc)-dimensional unstable manifold and a (fc -|- l)-dimensional stable 
manifold. Therefore, by (12.2.2), if these manifolds intersect transversely, the 
dimension of the intersection must be one. This is possible without violating 
uniqueness of solutions and invariance of the manifolds. 



End of Example 12.2.4 



12.3 Exercises 



1. Consider the space of linear, autonomous vector fields on 

(a) Give a rigorous definition of this space. Define a norm on this space. 

(b) Describe the set of structurally stable vector fields on this space. Is this set a 
residual set? 

2. Consider the unit interval in R. Show that the irrational numbers are a residual subset. 
Show that the rational numbers are not a residual subset. 

3. Consider the unit interval in R. Construct an open and dense subset having Lebesgue 
measure smaller than any specified number. Construct a residual subset with Lebesgue 
measure zero. (Hint: for help, see Hunt et al. [1992].) 

4. Discuss the idea of structural stability of the following vector fields and maps. 

a) (61,02) G X S\ 

b) ^ = 2’ (a:,y)GR^. 

c) (..y)GR^ 
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d) 



X = y, 

V — X — x^ — y, 



(x,y) G 



e) 9 9 uj, 9 £ . 

f) 9 9 uj -\- e sin 9, e small, 9 ^ . 



5. Consider an autonomous vector field on having a hyperbolic sink equilibrium point 
and a hyperbolic saddle equilibrium point. 



(a) Can the unstable manifold of the saddle intersect the stable manifold of the sink 
transversely? 

(b) Can the stable manifold of the saddle intersect the stable manifold of the sink 
transversely? 



6. Consider an autonomous vector field on having a hyperbolic periodic orbit and a 
hyperbolic saddle equilibrium point having a one dimensional stable manifold and a 
two dimensional unstable manifold. 



(a) Can the unstable manifold of the saddle intersect the stable manifold of the 
periodic orbit transversely? 

(b) Can the stable manifold of the saddle intersect the unstable manifold of the 
periodic orbit transversely? 
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Lagrange’s Equations 



Up to this point in the book we have been studying general dynamical 
systems, e.g., , x = f{x), where f{x) is just C’’. In this, and the following 
four chapters, we will study dynamical systems with special structure] La- 
grangian systems, Hamiltonian systems, gradient vector fields, reversible 
systems, and asymptotically autonomous vector fields. We will see that the 
particular special structure greatly constrains the types of dynamics that 
are allowed, and it also provides techniques of analysis that are particular 
to the special structure. These particular types of structure are important 
because they do arise in a variety of applications. 

In this chapter we will derive and discuss the properties of Lagrange’s 
euations of motion or Lagrangian dynamical systems for a system of P parti- 
cles each having constant mass and located at position = {xp, yp, Zp), 
p = 1, . . . ,P, where (xp, yp, Zp) denote the standard cartesian coordinates. 

The issue of the correct choice of coordinates is central to the under- 
standing of many problems in dynamical systems theory. Indeed, many 
techniques, such as normal form theory and invariant manifold theory, are 
primarily concerned with finding a coordinate system in which the dynam- 
ical system assumes the ’’simplest” form, or the dimensionality (i.e., in this 
context, we mean the number of equations) of the system is reduced. 

For a system of P point masses moving under the influence of external 
and internal forces it may be that there are certain functional relations 
among some of the coordinate components. In this case we say that the 
motion of the point masses is subject to certain constraints. For example, 
a particle could be constrained to move on the surface of a sphere, on an 
inclined plane, etc. 

Suppose the system of P particles is such that C functional relations 
must be satisfied by the coordinates of the particles. We represent these 
constraints as follows: 



4>i{xi,yi,zi,...,xp,yp,zp,t), i=l,...,C. (13.0.1) 

Constraints that are represented as functions of the coordinates and time 
in this manner are referred to as holonomic constraints. In this case we say 
that the system has N = 3P—C degrees of freedom. That is, the position of 
all the particles in the system can be specified by choosing N independent 
coordinates. In other words, the number of degrees of freedom of a system 
is the number of independent coordinates needed to specify the positions 
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of all the components (in our case, particles) of the system. Choosing such 
a system of independent coordinates brings up the notion of generalized 
eoordinates, which we now discuss. 



13.1 Generalized Coordinates 

We introduce the notion of generalized coordinates by first considering an 
example. 

Example 13.1.1 (The Double Pendulum). 

The double pendulum consists of two mass points moving in the x — y plane. 
The mass point mi is located at a fixed distance, h, from a fixed point O, and 
m 2 is located a fixed distance I 2 from mi, see Fig. 13.1.1. 




FIGURE 13.1.1. Geometry of the double pendulum. 

The location of mi is given by the cartesian coordinates (xi,y\) and the loca- 
tion of m 2 is given by (* 2 , 1 / 2 ). However, one sees that four coordinates are not 
necessary for describing the location of all the particles in this system. In fact, 
the location of each point can be described by a single angle, as shown in Fig. 
13.1.1. This is because each mass point must remain a fixed distance from a given 
point. This results in us being able to describe the location of the mass point by 
the angle around the point from which it remains a fixed distance. Moreover, we 
can write down the relation between the coordinates (*i, j/i, * 2 , 1 / 2 ) and { 6 ^, 62 ) 
as follows: 

*i=bcos6li, yi=hs.m6i, 113 11') 

X2 = hcos6\ l2cos02, y2 = hsm0\ l 2 sm 62 - ^ '' 

In this example { 61 , 62 ) are generalized coordinates for the double pendulum. 



End of Example 13.1.1 
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A set of coordinates that contain the minimum number of independent 
coordinates needed to specify all the positions of a set of particles is re- 
ferred to as generalized coordinates, and denoted by {q\, . . . ,qN)- General- 
ized coordinates may be distances, angles, or quantities relating them. The 
integer N is referred to as the number of degrees-of-freedom of a system. 
So a system is said to have N degrees-of-freedom if the positions of all 
its components can be described by N independent (generalized) coordi- 
nates. The space of generalized coordinates of a system is often referred to 
as the configuration space of the system (the reader should contrast this 
with the notion of “phase space”). Furthermore, we refer to (gi, . . . , q^) as 
generalized velocities. 

Getting back to the notion of constraints, one could also say that the 
constraints imposed on a system are holonomic if it is possible to choose a 
set of independent generalized coordinates to describe the position of the 
set of particles. The issue of constraints is very important in dynamics, 
and is beyond the scope of this book. Excellent discussions of constraints, 
both holonomic and non-holonomic, can be found in Sommerfeld [1952], 
Whittaker [1904], and Arnold [1978] . One of the strengths of the Lagrangian 
form of dynamics is that it deals with constraints in a very simple manner, 
especially holonomic constraints. In this chapter we will deal exclusively 
with holonomic constraints. 

The relationship between the original cartesian position coordinates of 
each particle and the generalized coordinates (see, as an example, eq. 
(13.1.1)) is expressed in the following form: 

Xp — Xp(^q± , . . . , gjY , t) , 

Vp = yp(gi,...,giv,^), 

Zp = Zp{qi, • ■ • j Qn ^ i)i P — (13.1.2) 

or, in vector notation 

r^p = -rp{qi,---,qN,t), p=l,...,P. (13.1.3) 

In the derivation of Lagrange’s equations of motion two relationships 
between the derivatives of the coordinates and velocities and the corre- 
sponding generalized coordinates and velocities will be particularly useful, 
so we want to derive them now. 

Differentiating (13.1.3) with respect to time gives: 



dvp . 



dvp . 

b — 

oqN 



dVp 

dt 




(13.1.4) 



dip _ dr^ 
dqi dqi ' 



from which it follows that 



(13.1.5) 
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If we assume that the second order partial derivative of (13.1.3) exist 
and are continuous then the order of differentiation with respect to any 
two variables can be interchanged. In this case we have 



_ dip 

dt V dq^ ) dq^ ’ 



(13.1.6) 



which can be derived directly from (13.1.4) with a bit of work, which we 
leave for the reader. 



13.2 Derivation of Lagrange’s Equations 



Now we are ready to derive Lagrange’s equations of motion. Newton’s sec- 
ond law of motion for the point mass rup acted on by the force Fp is given 

by 

mpVp = Fp. (13.2.1) 

Here the force Fp acting on the particle p could be due to internal forces of 
interaction amongst the particles (such as electrical or gravitational interac- 
tions) or external forces. This distinction is not important in the derivation 
of Lagrange’s equations. 

Imagine each point p is subject to a displacement dvp. Then the work 
done on the system of particles is given by 

p p 

''^mpTp ■ dVp = ''^Fp ■ dVp. (13.2.2) 

p—1 p—1 



This is the key expression in our derivation of Lagrange’s equations (al- 
though there are other ways in which Lagrange’s equations may be derived, 
see Sommerfeld [1952]). Computing the differential of (13.1.3) gives: 



N 



dvp = 



dr. 



dqt + ^dt 1 . 



^ V ^q^ dt 
Substituting (13.2.3) into the left hand side of (13.2.2) gives: 



(13.2.3) 



^ ^ ^ dr 

Y mpVp -drp = YJ2 + -T^dt 

p—1 p—1 1 ^ 



P N 



p—1 i—1 



dr. 



-Wto (-- r •— ^ -f ■—^\dQ- 
- l^^P\ ^t\^P f)r,: I ^P rlt f)n: I 



dq^ 



d dr. 



dt dqi 



dv 

+mpicp ■ ~^dt using (13.1.5) and (13.1.6) 
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d_ 

dqi 







dqt 






^dt 
dt ’ 



^ ' dt \d(ji ) dqi ) 



dqt 



p 



^dt, (13.2.4) 



where 






m„r. 



2 ■■"pip •-pi 

p=i 



(13.2.5) 



is the kinetic energy of the system of P particles. 

Substituting (13.2.3) into the right hand side of (13.2.2) gives: 



p 

E 

p^l 



N P ^ P ^ 

E E Fp ■ + E Fp • 

p^i p^i 

^ ^ f) 

^4>*% + ^Fp- 



(13.2.6) 



where 

(13.2.7) 

P=1 

is referred to as the generalized force associated with the generalized coor- 
dinate qt- 

Since (13.2.6) and (13.2.4) are equal (from (13.2.2)), subtracting these 
two equations gives: 



N 






(13.2.8) 



Since we are considering only holonomic constraints the dqi are all inde- 
pendent displacements. Then we must have: 



^ ^ 

dt \dq^J dq. 



i=l,...,iV. 



These are Lagrange’s equations of motion. 

If the forces are derivable from a potential function, i.e.. 



^^ = 



dV 
dqi ’ 



(13.2.9) 



(13.2.10) 
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then we can write Lagrange’s equations in a more compact form. We define 
the Lagrangian function (or, just the “Lagrangian” ) as 

L = T-V. (13.2.11) 

Then, since V is only a function of the qi, we have 

dL _ dT 
d(ji d(ji ’ 

and Lagrange’s equations become: 



dt \d(ji ) 







(13.2.12) 



From (13.1.3), (13.1.4), (13.2.5), and (13.2.7) we see that the Lagrangian 
has the following functional dependencies with respect to the generalized 
coordinates: 

L = L{qi, ...,qN,qi,.. -,qN,t). (13.2.13) 

In some cases it may be convenient to adopt a shorthand notation for 
(13.2.12). Let 

q={qi,...,qN), g = (gi, . . . , gw). 

Then we write (13.2.12) as 



dt \dq ) 




(13.2.14) 



where (13.2.12) is the definition of (13.2.14). 

We now give an example where we derive the equations of motion of a 
system using Lagrange’s equations. 

Example 13.2.1 (The Pendulum). 

As an example, we compute the Lagrangian, and Lagrange’s equations of mo- 
tion, for the pendulum, as illustrated in Fig. 13.2.1. We assume that all forces 
acting on the pendulum are conservative (e.g., there is no drag force as the pen- 
dulum moves through air). 

We choose as the generalized coordinate (the reader should convince him or 
herself that this is a one degree-of-freedom system) the angle 6 between the 
vertical line OA and the string, OB, of the pendulum. If I denotes the length of 
OB then the kinetic energy is given by 

T = -mv^ = -m{l9)^ = 

2 2 ^ ^ 2 

Next we compute the potential energy, V, We take as a reference line the 
horizontal line through the lowest point of the pendulum, denoted A. Then the 
potential energy due to gravity is given by 



V = mg {OA — OC) = mg {I — I cos 6) = mgl{l — cos 9). 
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FIGURE 13.2.1. Geometry of the pendulum. 
Hence, the Lagrangian is given by 

L = T — V = — mgl{l — cos9). 

Recall that Lagrange’s equations of motion are given by: 

^ - n 

dt\d9) 99 



Therefore 



and 



— = -mglsm9, 



^ = ml^9. 
89 



Hence, Lagrange’s equations for the pendulum are given by: 

ml^9 + mgl sin0 = 0, 



0 + y sin6» = 0. 



End of Example 13.2.1 



13.2a The Kinetic Energy 

We now want to make a few remarks about the kinetic energy function in 
generalized coordinates. From (13.2.5), we see that the kinetic energy is 
a quadratic function when expressed in the original cartesian coordinates. 
We now want to express the kinetic energy in the (g, q) coordinates. This 
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is done by substituting (13.1.4) into (13.2.5), and then after some routine 
algebra we obtain: 

N N N 

j (?) d” n-i(( 7 , t)< 7 i + f{q,t), (13.2.15) 

i—l j = l i—l 




Note that if the transformation between the original coordinates and the 
generalized coordinates (i.e., (13.1.3)) is independent of time, then the last 
two sums in (13.2.15) do not occur. 



13.3 The Energy Integral 



In the case where the generalized forces are derivable from a potential 
function (so that Lagrange’s equations are given by (13.2.12)), and the La- 
grangian is independent of time, Lagrange’s equations possess and integral 
of the motion, called the energy integral, which is given by: 

^ r)T 

B = (13.3.1) 

i=l 

We prove this quantity is an integral for Lagrange’s equations through the 
following calculation: 




using (13.2.12) 



(13.3.2) 
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So if L does not depend on time then we have 




13.4 Momentum Integrals 



In the previous section we saw that when the Lagrangian is time indepen- 
dent there is an integral for the system, the energy integral. Similarly, we 
will now see that when the Lagrangian does not depend upon a certain 
coordinate, say qk (although it may depend on qk), then there is also an 
integral for the system, called a momentum integral. The coordinate qk is 
referred to as a cyclic or ignorahle coordinate. 

In particular, suppose 

L = L{qi , . . . ,gffc_i, gffc+i, . . ,,qN,qi, . . .,qN,t). 

Then clearly 



and therefore it 



dL 

dqk 



= 0 , 



follows from Lagrange’s equations that 



d /dL\ 
dt \dqk) 



= 0 . 



Hence 



dL ^ 

TTT- = Pk = constant. 
dqk 



In general the quantity ^ is referred to as the fcth component of mo- 
mentum (or generalized momentum), and is denoted by pk- It plays an 
important role in Hamilton’s equations, which we now discuss. 



13.5 Hamilton’s Equations 

Now we will show how Hamilton’s equations can be derived using the La- 
grangian. 

The function 

N 

H{q,p,t) = '^Ptdi - L{q,q,t), (13.5.1) 

i=l 

is defined to be the Hamiltonian function (or just “Hamiltonian”) of the 
system. Note that it is just the energy integral given in (13.3.1) where 



dL 

Pi = 



and p= {pi,...,pn). 



(13.5.2) 
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This is an important point because the left hand side of (13.5.1) is denoted 
as a function of q, p, and t, rather than q, q, and t. This is accomplished 
by using (13.5.2) to solve for g as a function of p, and then substituting the 
result for q into (13.5.1), resulting in a function depending on g, p, and t. 
Note that differentiating (13.5.2) with respect to time and using (13.1.5) 
gives 




(13.5.3) 



Computing the differential of (13.5.1) gives: 



dH{q,p,t) 



N 

'^d{piq^) - dL, 

N 

didpt + Pidqi - 
N 

didpt - Pidqi - 

i=l 



dL , dL dL , 

7^ dqi r, . ddi vrrdt, 

uqi odi ut 

QL 

— (it, using (13.5.2) and (13.5.3). 

(13.5.4) 



Since, as described above, the Hamiltonian is a function of g, p, and t, we 
have, in general: 



dH{q,p,t) 



N 



E 



OH 

dqi 



dqi + 



OH 

dpi 



, dH , 
dpi + -T^dt. 
ot 



(13.5.5) 



The expressions given in (13.5.4) and (13.5.5) must be equal. Since the g^ 
and Pi are independent (holonomic constraints), equating the coefficients 
of dqi and dpi immediately leads to Hamilton’s equations 



with the relation 



dH 



opi 






dH 




(13.5.6) 


Pi = 

OQi 


i = l,...,iV, 


dH 


dL 


(13.5.7) 


In ~ 


~1h' 



13.6 Cyclic Coordinates, Routh’s Equations, and 
Reduction of the Number of Equations 

Now we will describe an elegant procedure due to Routh for reducing the 
number of equations that need to be integrated in the case where we have 
cyclic coordinates in the Lagrangian. 
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Suppose we have r cyclic coordinates. Without loss of generality (through 
relabeling the coordinates, if necessary) we can suppose these are the first 
r coordinates: 

/ 



9i, 



■ 1 Qri Qr-\-l 5 ■ 






V cyclic 

Then the Lagrangian has the form 

L = L{qr+i,. . .,qN, Qi, ■ ■ - ,qN), 
and, as we described earlier. 



dL ^ 

77 ^ = Pi = constant. 






(13.6.1) 



(13.6.2) 



(13.6.3) 



Using (13.6.3) we can solve for qi, i = l,...,r, as a function of 
qr+i,...,qN,qr+i,---,qN,(3i,---,Pr,t, i.c., we have 

qi = qi{qr+i,---,qN,qr+i,---,qN,Pi,---,(3r,t) i=l,...,r. (13.6.4) 



We then form Routh’s function, or the Routhian, defined as follows: 

r 

R = L~Y,p,q,, (13.6.5) 

i=l 

Computing the differential of (13.6.5) gives: 



N 

dR= Y. 

N 

= E 



N 



dL dL . dL , • 7 \ 

— ~ \P^dqi + qtdpi) , 



dq; 

(K 

dqi 



i=l 

N 



dL \ ^ dL . dL \ ^ f ^ ^ r- r-\ 

E ~^dq^ + —dt-Yd^dp^. (13.6.6) 






dt 



i=l 



However, as we described above, in general the Routhian is a function of 
qr+i,. . . ,qN,qr+i, - ■ ■ ,qN,ldi,. ■ ■ where Pi = pi, i = I, . . . ,r. Com- 

puting the differential of the Routhian with these functional dependencies 
gives: 




y^dR ^ dR , 



i=l 



dt 



(13.6.7) 



Now (13.6.6) and (13.6.7) must be equal. Since the coordinates qi, qi and 
Pi are independent we can then equate the coefficients on dqi, dqi and dpi 
to obtain Routh’s equations: 

dL _ dR 
dqi dqi ’ 
dL _ dR 
dqi dqi ’ 



i = r -I- 1, . . . , A^, 



(13.6.8) 
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with the relation 



<it = - 



(m 

dpi' 






Substituting (13.6.8) into Lagrange’s equations (13.2.14) gives: 



(13.6.9) 



d fdR\ dR 
dt ) dqi 



i = r + 1, . . . ,N. 



(13.6.10) 



Hence we solve the N — r equations for qr+i, . . . ,qN, dr+i, ■ ■ ■ We then 
substitute the result into (13.6.9) to obtain qi, i = 1, . . . ,r. In turn, these 
can then be integrated (since, now in principle they are known functions of 
time once (13.6.10) are solved) to give gi, i = 1, . . . , r. So we see that in the 
case where there are r cyclic coordinates the solution of Lagrange’s equa- 
tions can be “reduced” to the solution of fV — r (second order) differential 
equations. 



13.7 Variational Methods 

Trajectories of Lagrange’s and Hamilton’s equations can also be character- 
ized as extrema of a certain functional defined on paths in configuration 
and phase space, respectively. This leads to the variational principles of me- 
chanics, which we now describe. Excellent references with historical surveys 
can be found in Sommerfeld [1952] and Arnold [1978]^. 



13.7a The Principle oe Least Action 

Let C denote the set of curves in the configuration space with fixed end 
points, i.e., a “point” in C is a map 



t {qi{t), qN{t)) = q{t), to < t < ti, 

and the fixed endpoint condition means that every q{t) € C satisfies q{to) = 
go and q{t\) = q\. We equip C with the following norm. For q € C, the norm 
of g is defined as 



||g||= sup (|g*(f)| -k |g*(t)|) . (13.7.1) 

t6[to,ii], l<i<N 



We will need the curves in C to have at least one continuous derivative. 



^In most dynamics textbooks the action functional is stated and then it is 
shown that the extrema correspond to solutions of Lagrange’s equations. Som- 
merfeld [1952] provides an insightful derivation of the action functional by be- 
ginning from the principle of virtual work (a differential variational principle, as 
opposed to the integral variational principles discussed here). See also Arnold 
[1978] for a similar argument in more modern mathematical language. 
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If L{q,q,t) denotes the Lagrangian of a system then the expression de- 
fined by 

rti pti 

^( 9 )=/ L{q,q,t)dt= L{qi,...,qN,qi,...,qN,t)dt, (13.7.2) 

J tQ J tQ 

is called the action. The action is an example of a functional, i.e., a map 
from C to K. Now we want to define the notion of differentiability of this 
functional. 

Definition 13.7.1 The functional ^{q) is said to be differentiable at q if 
^{q + h) — d>(< 7 ) = F + R, where F = F{q,h) is a linear function of h (with 
hftg) = h{ti) = Q in order to satisfy the fixed end point constraint, and 
regarding q as fixed), and R= 0{\\ h |p). The term linear in h, F{q, h), is 
called the differential o/d*. 

We now define the notion of an extremal for the action. 



Definition 13.7.2 An extremal of the action ^ is a curve q such that 
F{q, h) = 0 for all h. 

Now we compute the differential of the action functional: 



J (^L{q + h,q + h,t)dt - L{q,q,t)^ dt, 



fdL d dL 



'to 



N 



dL. 



E / uij a uj-j \ uj-j 



7=1 



dcir 



■oiWhf), 



after integration by parts. 



(13.7.3) 



Now since h{to) 
given by 



h{ti) = 0 we see that the differential of the action is 




dt dq J 



hidt. 



(13.7.4) 



Now we set the differential of the action to zero and we can then appeal to 
a classical result from the calculus of variations (see, e.g., Arnold [1978]) 
that says that (13.7.4) is zero for all hi if and only if 



dL d dL 

dqi dt dq 



i=l,...N. 
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Now these are just Lagrange’s equations of motion, and we arrive at the 
following result, which is referred to as Hamilton’s principle of least action^. 

Theorem 13.7.3 (Hamilton’s Principle of Least Action) The curve 
to < t < ti is a solution of Lagrange’s equations: 



dt \dqi) 




i=l,...,N, 



if and only if it is an extremal of the action: 



^( 9 ) = [ L{q,q,t)dt. 
Jta 



Despite the name “principle of least action” we remark that an extremal 
of the action need not be a minimum. The only requirement is that the 
differential of the action must vanish on a trajectory. 

In the classical mechanics literature there is another common notation 
for the differential of the action. Namely, 

S f L{q,q,t)dt, (13.7.5) 

dto 

and the phrase “first variation’ or just “variation” of the action is used. 



13.7b The Action Principle in Phase Space 

The action principle can also be formulated in phase space. 

Let P denote the set of curves in the phase space where the q coordinates 
have fixed end points, i.e., a “point” in P is a map 



t {qi{t), . . . ,qN{t),pi{t),. . . ,PN{t)) = {q{t),p{t)), to < t < h, 

and the fixed endpoint condition means that for every curve (q(t),p(t)), 
the qft) component satisfies q{to) = qo and q{ti) = qi. We equip P with 
the following norm. For (q{t),p{t)) G P, the norm of (q{t),p{t)) is defined 
as 



II 11= sup {\q^{t)\ + \qi{t)\,\p^{t)\ + \p^{t)\) . (13.7.6) 

te[to,ti], l<i<N 

We will need the curves in P to have at least one continuous derivative. 



^While the name of Hamilton is associated with this integral variational prin- 
ciple in configuration space, Leibniz, Maupertnis, Euler, and Lagrange also made 
signihcant contributions. See Sommerfeld [1952]. 
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Since L = PiQi — H we can rewrite (13.7.2) and define the following 
functional on V: 

/ ^ (^^pidqi- H{q,p,t)d?j . (15.7.7) 



Next we compute the differential of this functional: 



/•‘i / ^ \ 

/ -Pidi - H(q + h,p+ k,t) + H(q,p,t) dt, 

•^‘o Vi=i / 

pti ^ riJ^ \ 

/ X! ( 11 ^)’ 

•^*0 j=i V opi ) 



rti 



N 



'*0 \i=i 



dH 



dH 






dqi 



N 



i^l 



+0(11 (h,k) f), 



(13.7.8) 



where the passage from the second to the third line in (13.7.8) is effected 
by integration by parts. Since h(to) = h(t\) = 0 we see that the differential 
of this functional is given by 




(13.7.9) 



Clearly, trajectories of Hamilton’s equations are extrema of (13.7.7). How- 
ever, the argument that extrema of (13.7.7) are trajectories of Hamilton’s 
equations requires a bit more care since pi and qi are related in time through 
the equation pi = Sommerfeld [1952] and Arnold [1978] both address 
this issue. Sommerfeld resolves the issue by noting that upon differentiat- 
ing (13.5.1) with respect to pi one obtains = qi. Hence the first term 
in braces in (13.7.9) vanishes. Since the hi are independent for each i, 
the second term in braces in (13.7.9) must also vanish (applying the same 
argument as was used to deduce Hamilton’s principle of least action in 
configuration space). Hence, we have the following result. ^ 

Theorem 13.7.4 (Hamilton’s Principle of Least Action in Phase 
Space) The curve (q(t),p(t)), to < t < ti is a solution of Hamilton’s 
equations: 



dH , dH 

= j^(<l,P,t), p^ = -j—(q,p,t), 
dp, dq. 



i = 1, 






^Poincare and Hilbert also studied this functional. 
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if and only if it is an extremal of the functional: 




Piq^ - H{q,p,t) dt. 



Rather than the phrase “Hamilton’s principle of least action” , often the 
shorter phrase “Hamilton’s principle” is used instead. This applies in both 
the configuration space and phase space formulations. 



13.7c Transformations that Preserve the Form of 
Hamilton’s Equations 

We now consider an application of the action principle in phase space. 
Consider the Hamiltonian 

H{q,p,t), (13.7.10) 

and the associated Hamilton’s equations: 

. dH ^ 
q= 

f) M 

p= - — {q,p,t). (13.7.11) 

Suppose we have a transformation of coordinates of the form 



Q = Q{q,p,t), 

P=P{q,p,t), (13.7.12) 

which we assume can be inverted (viewing t as fixed) to yield 

q = q{Q,P,t), 

p = p{Q,P,t). (13.7.13) 

Clearly, (13.7.13) can be substituted into (13.7.10) so that the Hamilto- 
nian in the original (q,p) coordinates can be written as a function of the 
{Q,P) coordinates: 

H{q,p,t)=H{Q,P,t). (13.7.14) 

However, it is not at all clear that this transformation preserves the form 
of Hamilton’s equations in the sense that 

Q=^{Q,P,t), 



(13.7.15) 
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Now we want to derive conditions on the transformation (13.7.12) so that 
(13.7.15) holds, and we will do this using Hamilton’s principle of least 
action in phase space. 

If (13.7.15) holds in the (Q,P) coordinates then we must have 

N 



and therefore 



5 / Pidqi — H dt = 6 / PidQi — Hdt = 0, (13.7.16) 

dto Jto 



or 



.ti N 

S / '^Pidqt - PidQi - {H - H)dt = 0, 

Jto 

from which it follows that 

N N 

dF{q, Q, t) = Y^pdF - Y + {H- H)dt, (13.7.17) 



for some function F{q,Q,t)- This function is referred to as the generat- 
ing function of the transformation (13.7.12). Writing out the terms of the 
differential of the left hand side of (13.7.17) gives 



OF dF dF 

Y + X! ^ ^ Yp^^^^ ~ X! + {H - H)dt, 

i —1 ^ i —1 * 2=1 i —1 



from which it follows that 



dF 

P^ = 

P- = -^ 

* dQP 
dF 



(13.7.18) 



So when F is known (13.7.18) gives the relation between the “old” coordi- 
nates (q,p), the “new” coordinates (P,Q), and the Hamiltonian. Note that 
if the system does not depend explicitly on time then the “new” Hamilto- 
nian function is the same as the “old” Hamiltonian function. Transforma- 
tions of coordinates given by (13.7.18) which preserve the form of Hamil- 
ton’s equations as described above are referred to as canonical transforma- 
tions or symplectic transformations (cf. Section 14.4). 



^The reader should be aware of how the terms and phrases “function” , “map” , 
“coordinate transformation” , and “operator” are often used interchangeably. 
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In some situations it may be more convenient to express the generating 
function not in terms of the old and new coordinates (i.e., in terms of q 
and Q), but rather in terms of the old coordinates q and the new momenta 
P. This can be accomplished by rewriting (13.7.17) as 

( N \ N N 

Pidqi + '^QidPi + {H - H)dt, (13.7.19) 

i=l / i=l i=l 

We express the argument of the differential on the left hand side of (13.7.19) 
as a function of q, P, t, which can be viewed as a new generating function 
G{q,P,t). Applying the same argument as above gives the relations 

dqi’ 
dG 

i7+^. (13.7.20) 

ot 

The reader should compare the discussion in this section with that in 
Section 14.4. 

13. 7d Applications of Variational Methods 

In recent years variational methods have been developed into a powerful 
and rigorous mathematical tool. Examples of some applications are given 
below. Kozlov [1985] provides an excellent survey of the application of 
variational methods in mechanics. 

Existence of Periodic Orbits. There is a vast literature on the use of 
variational methods for proving the existence of periodic orbits. See 
Rabinowitz [1978] and Struwe [1990] and the references therein. 

Existence of Homoclinic Orbits. In recent years variational methods 
have been developed for proving the existence of homoclinic orbits. 
See, e.g., Zelati et al. [1990] 

Existence of Invariant Tori. A proof of the KAM theorem (which is 
concerned with the existence of invariant tori in perturbations of in- 
tegrable Hamiltonian systems) by variational methods is given by 
Salamon and Zehnder [1989]. 

Existence of Heteroclinic Orbits. The existence of heteroclinic orbits 
are believed by many to be a key mechanism for global instabilities in 
Hamiltonian systems (such as “Arnold diffusion”). See, e.g., Mather 
[1993] and Bessi [1997]. 



Pr = 

Q^ = 
H = 
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Existence of Chaos. Sere [1993] has used variational methods to con- 
struct a chaotic invariant set modeled on the Bernoulli shift. 

Numerical Integration of Lagrange’s Equations. Hamilton’s princi- 
ple can be used to derive a class of numerical methods for solving 
Lagrange’s equations. An excellent background for this subject, as 
well as a literature review, can be found in Lewis and Kostelic [1996] 



13.8 The Hamilton- Jacobi Equation 



We now consider the action integral (or functional) defined in (13.7.2) from 
a different point of view. The action as defined in (13.7.2) is a functional 
defined on the set of curves in configuration space having the same end- 
points. We now want to view this integral as defined on extremals (i.e., on 
trajectories of Lagrange’s equations) and as a function of the endpoint of 
the extremal. In order to make this precise we will need to adopt a slightly 
different notation. 

Let q{t) denote an extremal of (13.7.2) with q{to) = qo and q{t) = q. 
Then we define ^ 

S{q,t)=[ L{q{T),q{T),T)dT. (13.8.1) 

Jto 

In this way we view the action as a function of the endpoint of an extremal 
of (13.7.2) (and we view qo and to as fixed). Arnold [1978] discusses a 
number of technical difficulties that can arise in this definition. 

Recalling (13.7.3), the differential of (13.8.1) (where variations in q, but 
not t are considered) is given by 



dS(q, t)h 





(13.8.2) 



Now since q{t) is an extremal the first term vanishes, and since q{to) = qo 
is fixed for each extremal under consideration, (13.8.2) reduces to 



^ dL ^ 

dS{q,t)h = ^ 

i—l ^ i—1 



(13.8.3) 



Since only q (and not t) was varied in computing the differential of S{q,t) 
we immediately obtain the relation 

dS 

Pi = j—, (13.8.4) 

dqi 

From (13.8.1) we have 

f = L, (13.8.5) 
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and therefore 



N 



dS _ dS . as 
~di ~ + 



i=l 



dqi 



at 



N 

= + 
i=l 



Ih' 



(13.8.6) 



Equation (13.8.6) can be rewritten as 

as 

— +H{q,p,t) = Q, (13.8.7) 

or 

f)Q / \ 

- + i7(,,-,t)=0, (13.8.8) 

Equation (13.8.8) is a partial differential equation for the function S{q,t) 
which is known as the Hamilton- Jacobi equation. From a solution of the 
Hamilton- Jacobi equation one can obtain the trajectories of the correspond- 
ing Hamilton’s canonical equations. We now describe this procedure, but 
first we need a definition. 



Definition 13.8.1 (Complete Integral of the Hamilton-Jacobi 
Equation) If 



(p{qi, ...,qN,t-,ai,...,aN) = (p{q, t] a), 



is a solution of (13.8.8), depending on the N constants a 
such that 







in the domain of interest, then 



(oi, . . .,aiq) 



S = 4> a, 



where a is a constant, is said to be a complete integral of (13.8.8). 

Now we prove the theorem that also provides a procedure for constructing 
the trajectories of Hamilton’s canonical equations from a complete integral 
of the Hamilton-Jacobi equation. 

Theorem 13.8.2 If a complete integral: 



S = 4>{qi, • ■ • , qN, t',ai,..., gn) + a, 
is known for the Hamilton-Jacobi equation (13.8.8): 



as 



at 



as 



-Ti7 + H[q,^,t] = 0 , 



aq ' 
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then from the equations 



, d(j) ■ 1 Ar 

Udi 



(13.8.9) 



with the 2N arbitrary constants at, bi, one obtains (implicitly) the 2N- 
parameter family of solutions of Hamilton’s equations: 



Pi = -- 






(13.8.10) 



Proof: The proof is taken from Courant and Hilbert [1962]. Since we are 
assuming that 

(li) ^ “• 

then we can solve the N equations 

n 

uCli 

for qi as a function of t and the 2N constants Oi, bi. If we substitute these 
functions into 

d(j) 

IT =Pi, 

oqt 

then we obtain pi as functions of t and the 2N constants a^, bi. 

Now we need to show that the solutions obtained in this way are solutions 
of Hamilton’s equations. 

We begin by differentiating the equation ^ = bi with respect to t to 
obtain: 

= n 1138111 



dtdoi ^ dqjdoi dt 

j=i ■> 



We next differentiate ^ + H{q, ^,t) = 0 with respect to Oi to obtain: 



dH d'^ej) ^ 
dtdoi dpj dqjdoi 



(13.8.12) 



Now we subtract (13.8.11) from (13.8.12), and use the fact that 
det yf 0 to obtain 

dH . ^ 

dj = j = l,...,iV. 

dPj 
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Next we need to show that the functions we obtained satisfy pi = — 

The argument is similar to the one just given. We first differentiate the 
equation ^ = pi with respect to t to obtain: 



dpi 

dt 



&^(j) 

dtdqi 



N 






d'^cj) dqj 

dqidqj dt 



(13.8.13) 



We next differentiate ^ + H{q, ^,t) = 0 with respect to qi to obtain: 



0 = 



d‘^4> 

dtdqi 



N 

E 

i=i 



dH d^(t) dH 

dpj dqjdqi dqi ’ 



(13.8.14) 



We have already shown that our functions satisfy qj = which we 
substitute into (13.8.14). Then we subtract (13.8.14) from (13.8.13) to im- 
mediately obtain: 

dH . , 

Pi = 

This completes the proof of the theorem. □ 



Example 13.8.1. Now we will give an application of the Hamilton-Jacobi 
method by solving for the trajectories of the simple harmonic oscillator. 

The Hamiltonian for the simple harmonic oscillator is given by: 

H = ^ + (13.8.15) 



Substituting p = ^ into (13.8.15), the Hamilton-Jacobi equation is found to be: 



^ 1 
dt 2 




1^22 n 

-f -cu g = 0. 



(13.8.16) 



Now we need to find a complete integral of the Hamilton-Jacobi equation. We 
assume a solution of the form: 



S{q,t) = Si{q) + S 2 {t). (13.8.17) 

Substituting (13.8.17) into (13.8.16) gives: 



dS2 

dt 



dSiV 

dq ) 



1 2 2 
+ 2^ ? . 



Now the left hand side of this equation is a function of t and the right hand side 
is a function of q. For equality to hold, each side must be equal to a constant, a: 



dS2 

dt 



dSiV 

dq ) 



loo 

+ g = a. 
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Then 

yields 

and 

yields 

Therefore 



1 fdSi\‘ 
dq J 



loo 

+ 2 ‘^ 1 = O’ 



Si = 



2 [a - ]dq, 



dS2 

S 2 = —at. 



S{q,t;a) = J ^ 2 ^ 



a — -a>^q^ ) dq — at. 



(13.8.18) 



With a solution of the Hamilton- Jacobi equation in hand, we can now follow 
the procedure described in Theorem 13.8.2 and obtain the solutions of Hamilton’s 
equations. 

We set 



da ^ ^ y/2 



y 



dq 



- t. 



a — 



(13.8.19) 



Computing the integral gives: 



1 . -1 
— sm 

U! 






— t 



q{t) — yy. sinai(t + b). 

LU 

The expression for p{t) is obtained through the relation 

dS 

The constants a and b are determined when the initial conditions for the trajec- 
tory are chosen®. 



End of Example 13.8.1 



®In deriving the equation for Si we took a square root, for which there is a 
choice of sign. The reader should check that we have not missed something here 
and, if not, why not? 
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13.8a Applications of the Hamilton-Jacobi 
Equation 

The main reason for developing the Hamilton-Jacobi equation is that it 
has plays a role in a large variety of applications nowadays. Accordingly, 
it is impossible to make more than a feeble attempt at a survey of the 
applications®. Nevertheless, we will provide a few pointers to the literature 
that are relevant to dynamical systems theory. 

Courant and Hilbert [1962] provides a mathematically gentle, and 
straightforward, introduction to the Hamilton-Jacobi partial differential 
equation. The books of Benton [1977] and Rund [1966] provide further 
references and the monographs of Lions [1982] and Barles [1994] provide 
a deeper look at the mathematics. Recent papers describing the asymp- 
totic behavior of solution of the Hamilton-Jacobi equation are Barles and 
Souganidis [2000a,b], and Roquejoffre [2001]. Recently there has been very 
interesting work involving generalizing the work of Aubry [1983a,b] and 
Mather [1982], [1984], [1986] on twist maps to other settings. See Fathi 
[1997], Evans and Gomes [2001], and Gomes [2001 a,b,c]. The Hamilton- 
Jacobi equation has been used to study the existence of homoclinic and 
heteroclinic orbits in near integrable systems. See Fathi [1998], Gallavotti 
et al. [2000], Rudnev and Wiggins [1999], [2000], and Sauzin [2001]. 



13.9 Exercises 



1. Give a set of generalized coordinates needed to completely specify the motion of each 
of the following systems: 

(a) a particle constrained to move on an ellipse, 

(b) a particle constrained to move on the surface of a sphere, 

(c) a pendulum that is not confined to a plane, i.e., it can move in three dimensional 
space. 



2. Derive (13.1.6) directly from (13.1.4). 

3. For the double pendulum described in Example 13.1.1: 

(a) compute the Lagrangian, 

(b) compute Lagrange’s equations of motion. 



4. 



Suppose that the transformation between the original coordinates and the generalized 
coordinates (i.e., (13.1.3)) is independent of time and let T denote the kinetic energy. 
Show that 



2T = J2q. 



dT 

Wi' 



5. Compute the energy integral (13.3.1) for the simple pendulum described in Example 
13.2.1. Show that it is constant on trajectories. 



®lndeed, typing “Hamilton Jacobi” into the MathSciNet search engine yielded 
842 references, Web of Science gave 1229, and Google gave 19,400. 
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6. Compute the energy integral (13.3.1) for the double pendulum described in Example 
13.1.1. Show that it is constant on trajectories. 

7. Consider the simple harmonic oscillator with Hamiltonian given by 




(a) Compute the Lagrangian and Lagrange’s equations. 

(b) Show that the transformation 



Q = 




sin Q, 



p — y/2Pu> cos Q , 



is canonical. 

(c) Show that in the Q — P coordinates Q is a cyclic coordinate. 

(d) Compute the Routhian and Routh’s equations. 



Consider a simple harmonic oscillator of mass m and spring constant k free to move 
in a plane. In plane polar coordinates the kinetic and potential energies are given by 

T ^ , 



y = -kr 
2 



Compute the Lagrangian and Lagrange’s equations of motion. Show that this system 
has two independent integrals. 

9. Compute the Hamiltonian and Hamilton’s equations of motion for the simple pendulum 
described in Example 13.2.1. 

10. Compute the Hamiltonian and Hamilton’s equations of motion for the double pendu- 
lum described in Example 13.1.1. 

11. Consider a simple harmonic oscillator of mass m and spring constant k free to move 
in a plane. In plane polar coordinates the kinetic and potential energies are given by 



T = Im , 

V = -kr^. 

2 



Compute the Routhian and Routh’s equations for this system. 

12. The Henon-Heiles Hamiltonian for a three dimensional potential is given by (Ferrer et 
al. [1998]): 



H = 



1 

2 



{pI +pI+pI) 



+ (^x'^ +V^+ 

(^{x^ + y^)z - -z^) , 



where e, u; > 0 are constants. 

(a) Compute the Lagrangian and Lagrange’s equations for this system. 

(b) The potential for this problem is said to be an axisymmetric potential. Con- 
struct a canonical transformation for which x — y are transformed to cylindrical 
coordinates, and the z coordinate is left alone. Verify that the coordinate trans- 
formation is canonical and write the Hamiltonian in these new coordinates. Show 
that one obtains a cyclic coordinate in these new coordinates. 
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(c) Using the cyclic coordinate obtained in the previous exercise, compute the 
Routhian and Routh’s equations. 

13. This exercise comes from Sommerfeld [1952]. Calculate the value of the action between 
the limits f = 0 and t — ti: 

(a) for the real motion of a falling partcle, z — 

(b) for two fictitious motions z — ct and z — at^, where the constants c and a 
must be so determined that the initial and end conditions coincide with those of 
the real path, in agreement with the rules for variations of paths in the action 
principle. Show that the integral has a smaller value for the real motion than 
for the fictitious motions. 

14. Show that the following transformations are canonical: 

(a) Q = p, P = -q, 

(b) Q — q tanp, P — log (sin p). 

15. Show that the transformation 



qi — r cos X) Pi — P cos X ~ ^ sin 



q2 — r sin x> P2 — P sin X + ^ cos ( — ) , 



is canonical. 

16. This exercise requires some knowledge of the method of characteristics from the theory 
of partial differential equations (see, e.g., Courant and Hilbert [1962]). Consider the 
Hamilton-Jacobi equation: 



dS 

dt 







= 0 . 



Show that the characteristics for this partial differential equation are given by the 
trajectories of Hamilton’s canonical equations: 



9 = 



dH 

dp 



p = - 



dH 



17. Consider a particle of mass m moving in the x — y plane under the influence of a 
central force that depends only on the distance from the origin. Use polar coordinates 
to describe the configuration space and let U(r) denote the potential due to the central 
force. 

(a) 

(b) 

(c) 



Compute the Lagrangian and Lagrange’s equations. 

Compute the Hamiltonian and Hamilton’s equations. 

Suppose that the central force has the form of an inverse square force, i.e., 



for some constant K. Use the Hamilton-Jacobi method to solve for the trajec- 
tories of Hamilton’s equations. 
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18. Show that if the function H in the Hamilton-Jacobi equation is independent of time 
then the solution S has the form 

S{q) = So{q) - Et. 

Furthermore, show that the corresponding Hamilton-Jacobi equation has the form 



where is a constant representing the total energy of the system. 

19. This exercise requires some preliminary motivation. Consider a Hamiltonian system 
with Hamiltonian H — {QjP) ^ . Suppose we make a canonical coordinate 

transformation: 

Q = Q(q,p), P = P{q,p), 



with inverse 



q = q{Q,P), p = p(Q,P), 



such that in the (Q, P) coordinates the new Hamiltonian depends only on the Q vari- 
ables, i.e., 

H{q,p) = H(Q). 

Since the transformation is canonical Hamilton’s equations become: 

dH 

Q= ^(Q) = o. 

These equations can be easily integrated, and the expression for the trajectories is: 

Q(i) = Q(0) = Qo, 

P{t) = p(o)-t|^(Qo). 

oQ 



Now all we need is the appropriate canonical transformation that “simplifies” the 
Hamiltonian in the manner described above. Accordingly, we seek a generating func- 
tion, S{q,Q), satisfying: 

H = ttiQ). (13.9.1) 



Now viewing Q as constants, this is exactly the form of the Hamilton- Jacobi equation 
(for a time-independent Hamiltonian). Hence, finding a solution of the Hamilton-Jacobi 
equation of the form of (13.9.1) leads to the integration of Hamilton’s equations. 

This result is due to Jacobi, and is summarized in the following theorem. 

Theorem 13.9.1 (Jacobi’s Theorem) If a solution S{q, Q) is found to the 
Hamilton-Jacobi equation (13.9.1) depending on the N parameters Q such that 



I dS \ , 

det ( 1 ^ 0, 

I dqdQ i ^ 



then Hamilton’s equations 



q = -^{q,p), 



p = --^{q,p)< 
dq 



(13.9.2) 
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can be be solved explicitly. The functions Q{q,p) determined by the equations 



p = 



dS 



(q.Q). 



are first integrals of (13.9.2). 



Provide all the details to the proof of Jacobi’s theorem. 

20. Consider the Hamilton- Jacobi equation for a time independent function H\ 



H 







q e ] 



(13.9.3) 



Suppose that it has a solution of the form 

S — 5i (gi , ai , . . . , on) + • • • + -SivC^iV, ai, . . . , o-iv) — Et, (13.9.4) 



where (ai, . . . , a^) are constants. A Hamilton-Jacobi equation possessing a solution 
of this form is said to be separable. Let Ci denote a simple closed curve in the qi — pi 
plane. Then the integral 

Ji = ^ Pidqi, (13.9.5) 



is referred to as the phase integral or action variable. 

(a) Show that the Ji are functions of ai, . . . , Uiv only and, hence, we can write 
S — S\{qi, Ji, . . . , Jn) + ••• + SN{qN , Ji, . . . , Jn) — Et. 

(b) Define 

dS 

wi ^ ( 13 . 9 . 6 ) 

and consider the canonical transformation from the q — p coordinates to the w—J 
coordinates defined by this generating function. Let H denote the Hamiltonian 
in these coordinates. Show that Hamilton’s equations can be integrated in these 
coordinates and that the trajectories are given by 



Ji(t) = Ji(0) = constant, 



Wi{t) — Wi{0) + 



dH 



(J(0))i, 



(13.9.7) 



where J(0) = (Ji(0), . . . , Jiv(O)). 
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Hamiltonian Vector Fields 



In this section we will develop some of the basic properties of canonical 
Hamiltonian vector fields. Background on Hamilton’s equations can be 
found in any book on classical mechanics (see, e.g., Whittaker [1904] or 
Goldstein [1980]). Over the past 15 years there has been a great deal of 
research on Hamilton’s equations. Most of the research has occurred along 
two directions. One direction is concerned with the geometrical structure of 
Hamilton’s equations. The other direction is concerned with the dynami- 
cal properties of the flow generated by Hamiltonian vector fields. Excellent 
references for both viewpoints are Abraham and Marsden [1978], Arnold 
[1978], Guillemin and Sternberg [1984], and Meyer and Hall [1992]. 

For a C’’, r > 2, real valued function on some open set U C Hamil- 
ton’s canonical equations are given by 

f)J-T 

P=-^(q,p), {q,p)€UcR^-. (14.0.1) 

It will often be convenient to write these in a more compact notation. 
Defining x = {q,p), then (14.0.1) can be written as 

x = JDH{x), (14.0.2) 



where 



J = 



0 id 
-id 0 



“id” denotes the n x n identity matrix, and DH{x) = (^, ^)- 



In this section we will develop a number of ideas related to the 
trajectories generated by Hamiltonian vector fields. In this con- 
text, we will assume that the trajectories exist for the length of 
time required in order for the ideas under discussion to “make 
sense”. Of course, in specific applications this should be verified. 



Before proceeding further it will be useful to first establish notation that 
we will frequently use throughout our discussion. 
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Notation 



Coordinates. Points in the phase space will be denoted by a; = 
(q,p) = {qi,...,qn,Pi,---,Pn)- 

Derivatives of Real Valued Functions. For a real valued function H 
on we will use the following notation for the derivative with 
respect to the q coordinate: 

dH _ (dH dH\ 
dq ~ \dqi" ' ' dqn) ' 

Moreover, we will also use the following shorthand notation: 



dHdG dH dG 
dq dq ~ ^ do.- do. ’ 






dqi dq^ 



with a similar notation for etc. 

Maps of the Phase Space into Itself. We will denote C'’, r > 1 maps 
of R2" into ]R2n by 

/ = (Q,P) : R2n 

X = {q,p) y = f{x) = {Q{q,p),P{q,p)), 



where 

{Q{q,p),P{q,p)) = {Qi{q,p), ■■■, Qn{q,p),Pi{q,p), ■ ■ .,Pn{q,p)) ■ 

By the symbol ^ we mean the following n x n matrix: 

/ dQi dQi \ 

I dqi dqn \ 



dq 



\ . . . dQn I 

\ dqi dqn / 



^ ^ will denote the n x n matrix whose i — j entry is given by 



dQdQ 
dq dp 






-E 

fc=i 



dQi dQk 

dqk dpi ' 



General Notation for Hamiltonian Vector Fields. For the sake of a 
more compact notation we will occasionally denote the Hamiltonian 
vector field derived from a function H by 



Xh{x) 



f dH dH\ 
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By a symplectic form on we mean a skew-symmetric, nondegenerate 
bilinear form. By nondegenerate we mean that the matrix representation 
of the bilinear form is nonsingular. A vector space equipped with a sym- 
plectic form is called a symplectic vector space. For our phase space a 
symplectic form is given by 

= {u, Jv), M, (14.1.1) 

where (•,•) denotes the standard Euclidean inner product on This 
particular symplectic form is referred to as the canonical symplectic form 
for reasons that we will now explain. 

Throughout this section our symplectic vector space will be 
equipped with the canonical symplectic form. However, virtually 
every result that we derive is valid for Hamilton’s equations aris- 
ing from general symplectic forms on finite dimensional vector 
spaces. The reader should consult Abraham and Marsden [1978] 
for the details of this more general theory. 

Note that nondegeneracy of the canonical symplectic form follows from the 
nondegeneracy of the Euclidean inner product and the invertibility of J. 

14.1a The Relationship Between Hamilton’s 
Equations and the Symplectic Form 

We next look at Hamilton’s equations from a different point of view, and 
one that is more in line with much current mathematical research in Hamil- 
tonian mechanics. 

We say that the symplectic form 0(-, •) defines a symplectic structure on 
the phase space K^". For a given Hamiltonian function H, the correspond- 
ing Hamilton’s equations are then derived from the symplectic structure 
through the following formula 

VL{Xh{x),v) = {DH{x),v), X G 1/ C u G ]R2«_ (14.1.2) 

One can think of (14.1.2) as an equation for Xh{x), for a given H{x). 
Often, (14.1.2) is written in the following shorthand notation 

ixH^ = DH, (14.1.3) 

where ixn^ = and the pointwise nature of the formula is not 

explicitly denoted. The symbol ixg^ is referred to as the interior product 
of the vector field Xh with the bilinear form Q. This operation creates a 
linear form which makes the left-hand-side of (14.1.2) compatible with the 
linear form {DH, ■) on the right-hand-side of the formula. 
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Now we return to the question of deriving Hamilton’s equations from 
this formula. Let X = (q,p) denote an arbitrary vector field on U C 
with DH = Then (14.1.2) becomes 

,v). (14.1.4) 

It is a simple calculation to verify that = —J and, thus, that 



{{q,p),Jv) = {-J{q,p),v) = {{-p,q),v). (14.1.5) 

Substituting (14.1.5) into (14.1.4) gives 

Now we are practically done, we need only appeal to nondegeneracy of the 
symplectic form. For fixed v, using linearity, we can rewrite (14.1.6) as 



which holds for all v G K^". Hence by nondegeneracy of the symplectic 
form we must have 



i-p,q) 



f dH dH \ 



= 0 , 



or, 



q = 



dH 

dH 



P = 



dq ’ 

which are Hamilton’s canonical equations. 



14.2 Poisson Brackets 



Let H, G : C/ — >■ M denote two C"", r > 2, functions. Then the Poisson 
bracket of these two functions is another function, and it is defined through 
the symplectic form as follows: 

{H,G} = n{XH.XG) = {Xh,JXg). (14.2.1) 



It follows immediately that the Poisson bracket is antisymmetric. Using the 
definitions of Xh, Xg, as well as the canonical symplectic form, we easily 
see that (14.2.1) assumes the following “coordinate” form: 



{H,G}^Y1 

i=l 



dH dG 
dqi dp. 



dH dG 

dpi dqi ■ 



(14.2.2) 
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14.2a Hamilton’s Equations in Poisson Bracket 
Form 



For a scalar valued C’’, r > 2, function F : U — >■ M, [/ C the rate of 
change of this function along the trajectories generated by the Hamiltonian 
vector field = (||, -||) is given by 



• ^dF . dF . 



-E 



dF dH 
dqi dpi 



= {F,H}. 



dF dH 
dpi dqr ’ 



Hamilton’s equations are alternately written as follows: 

F = {F,H}, VF:C/^M. (14.2.3) 

To see that (14.2.3) implies (14.0.1) we can substitute the “coordinate 
functions” {q,p) i-A gi and (g,p) ^ Pi into (14.2.3), which will yield (14.0.1). 

It follows easily from (14.2.2) that for any scalar valued C”, r > 1, 
function F : U ^ M., U C we have 



{F,F} = 0. 

Using this fact, as well (14.2.3), we obtain the following result. 

Proposition 14.2.1 The Hamiltonian H{q,p) is constant along trajecto- 
ries of the Hamiltonian vector field Xh- 



More generally, we refer to any function F satisfying 



{F,H} = 0, 



as an integral or constant of the motion with respect to the dynamics gener- 
ated by the vector field Xh- Integrals have a nice geometric interpretation 
which can be deduced from the following simple calculation: 



{F,H} = {Xp,JXh), 

= -{JXf,Xh) = 0. 



Now 



OF 



JXp = J 



dF 
' dq 




and the vector ~ is just a vector perpendicular to the level set 

of F, at each point at which it is evaluated. Hence, an integral has the 
property that the vector field is tangent to the surface given by the level 
set of the integral. 
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14.3 Symplectic or Canonical Transformations 

Symplectic, or canonical, transformations and their properties play a very 
important role in Hamiltonian mechanics. We begin with a definition. 

Definition 14.3.1 (Symplectic or Canonical Transformations) 

Consider a C"", r > 1, diffeomorphism f : — >■ K^”. Then f is said to 

be a canonical or symplectic transformation if 

= fl{Df{x)u, Df{x)v), Vx, u, f G (14.3.1) 

We remark that Definition 14.3.1 could have been much more general. In 
particular, the transformation could be between spaces of different dimen- 
sions (and therefore also not invertible). However, for our purposes this 
definition will be sufficient. 

For the canonical symplectic form (14.3.1) takes the form 

{u, Jv) = {Df{x)u, JDf{x)v) 

= {u,{Df{x)f JDf(x)v), (14.3.2) 

where (Df{x))^ denotes the transpose of the matrix Df{x). Since (14.3.2) 
must hold for all u,v we have 

{Df{x)f JDf{x) = J, (14.3.3) 

which provides a computable means for determining whether or not a trans- 
formation is symplectic with respect to the canonical symplectic form. If we 
take the determinant of (14.3.3), and use the easily verifiable facts that 
det J = 1 and det (Df{x))^ = detDf{x), we obtain 

{det Df{x)f = 1. 



Hence, 

det Df{x) = ±1, 

and we see that symplectic transformations are volume preserving. Actu- 
ally, it can be shown that det Df{x) = 1, so that symplectic transformations 
are also orientation preserving (see the exercises). 

For later purposes it will be useful to write out (14.3.3) using the q — p 
local coordinates that we defined earlier. Writing the symplectic transfor- 
mation / as 

/ : (q,p) {Q{q,p),P{q,p)) , 

the Jacobian of / is given by 
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and (14.3.3) takes the form 

A^JA=J. (14.3.4) 

We refer to a matrix satisfying (14.3.4) as a symplectic matrix. 



14.3a Eigenvalues of Symplectic Matrices 

Proposition 14.3.2 Suppose that A is a symplectic matrix and that A G C 
is an eigenvalue of A. Then A, and i are also eigenvalues of A. If X is 
an eigenvalue of multiplicity k, then ^ is also an eigenvalue of multiplicity 
k. Moreover, the multiplicities of the eigenvalues +1 and —1, if they occur, 
are even. 

Proof: We begin with the following algebraic manipulations of the charac- 
teristic polynomial of A using simple properties of determinants (where 1 
denotes the 2n x 2n identity matrix): 

p(A) = det {A — Al) 

= det (j(A-Al)J-i), 

= det — Al^ , using (14.3.4), 

= det {A~^ — All) , since det A~^ = detA~^^, 

= det (A-i(Il-AA)), 

= detA~^ det (II — AA) , 

= (_A(/l-ill)), = ^ = 

= \^^det > 

= A> Q) . (14.3.5) 

Since det A = 1 it follows that 0 is not an eigenvalue of A. Therefore from 
(14.3.5) it follows that if A is an eigenvalue so is j. Moreover, since the 
coefficients of the characteristic polynomial are real {A is real), then if A is 
an eigenvalue so is its complex conjugate, A. 

Next we consider the issue of the multiplicity of the eigenvalues. Suppose 
Ao is an eigenvalue of multiplicity k. Then the characteristic polynomial can 
be factored as follows 

p(A) = (A-Ao)'=Q(A), (14.3.6) 

where Q{X) is a polynomial in A of degree 2n — k. Using (14.3.5) and some 
trivial algebra, (14.3.6) can be written as 

t(^)a2” = (A-Ao)"Q(A) = (AAo)"(^-^) g(A), (14.3.7) 
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or 




ly Q(A) 

A ) ■ 



(14.3.8) 



Now is a polynomial in A of degree 2n—k. So it follows from (14.3.8) 
that ^ is an eigenvalue of multiplicity £ > k. 

Next, we reverse the roles of Aq and use the fact that ^ is an eigen- 
value of multiplicity £, and go through the same argument. After some 
trivial algebra we obtain 



P 




(. 1 V 0(A) 

'v Wj A2"-^- 



(14.3.9) 



Now is a polynomial in j of degree 2n — £. Substituting ^ for A 

in (14.3.9), and using the fact that Aq is a zero of the right hand side of 
(14.3.9) of multiplicity k, we see that ^ has multiplicity £ < k. Therefore 
£ = k and we are done. 

Now 1 or —1 is an eigenvalue if and only if Aq = ^. It follows from 
the above result that the multiplicity of the eigenvalues 1 and —1 is even. 
However, det A = 1, therefore the multiplicity of each must be even. □ 



14.3b Infinitesimally Symplectic Transformations 

Definition 14.3.3 (Infinitesimally Symplectic Transformations) 

Consider a C'' , r > 1, map f : — >■ K^”. Then f is said to be an 

infinitesimally symplectic or O skew transformation if 

£l{Df(x)u,v) = —£l{u,Df(x)v), Vx, m, w G (14.3.10) 

In terms of the canonical symplectic form, (14.3.10) becomes 

{Df{x)u, Jv) = —{u, JDf(x)v). (14.3.11) 

Using the same manipulations as above, (14.3.11) can be transformed into 
the following form 

(j'^Df{x)u,v) = -{{JDf(x))'^ u,v), 

which holds for all u, v G Hence, using = — J, we have 

JDf{x) + Df{xfj = 0. (14.3.12) 

We refer to a matrix satisfying (14.3.12) as an infinitesimally symplectic 
matrix. It can be shown that the exponential of an infinitesimally symplectic 
matrix is a symplectic matrix, see Guillemin and Sternberg [1984]. There 
is a connection here with Lie algebras and groups which the reader can 
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find fully developed in Abraham and Marsden [1978] or Guillemin and 
Sternberg [1984]. 

The importance of the notion of “infinitesimally symplectic” follows from 
the following proposition. 

Proposition 14.3.4 Let X : U ^ denote a C”, r > 1 vector field on 
some open, convex set U C Then X = Xh, for some Hamiltonian 

H : U ^ if and only if DX{x) is an infinitesimally symplectic matrix 
for all X € U. 

Proof: Differentiating (14.1.2) with respect to x in the direction u, and 
using bilinearity of 12 gives 

12 {DXh{x)u, v) = B^H(x)(v, u). (14.3.13) 

From this, and the symmetry of the second partial derivative matrix, we 
obtain 



12 {DXh{x)u, v) = H{x){y, u), 

= 12 {DXh{x)v, u) , 

= —12 (u, DXh{x)v) (using skew symmetry). 

(14.3.14) 

From this calculation it follows that if Xh{x) is Hamiltonian, then DXh{x) 
is an infinitesimally symplectic matrix. 

Now suppose that DX(x) is an infinitesimally symplectic matrix. Using 
convexity of U, we define 

H{x) = f 12(A(te), x)d2 + constant, (14.3.15) 

Jo 

and claim that X = X^. This is deduced from the following calculation 

(DH{x),v) = f {Lt{DX{tx)tv,x) + Lt{X{tx),v)) dt 
Jo 

1 

(— 12(u, tDX{tx)x) + Lt{X (tx) , v)) dt, 

1 

{Lt{tDX{tx)x, v) + Lt{X{tx),v)) dt, 

= ^( [ ^ (tX(tx)) dt,v 
\Jo dt 

= n{X{x),v). (14.3.16) 




Hence, by the definition in (14.1.2), X is Hamiltonian with respect to the 
Hamiltonian (14.3.15). □ 
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14.3c The Eigenvalues of Infinitesimally 
Symplectic Matrices 

It follows from the previous proposition that the matrices associated with 
the linearization about equilibria of Hamiltonian vector fields are infinites- 
imally symplectic. The following result is therefore useful for the study of 
stability of equilibria of Hamiltonian vector fields. 

Proposition 14.3.5 Suppose A is an infinitesimally symplectic matrix. 
Then if X € C is an eigenvalue of A so are —X, X, and —X. If X is an 
eigenvalue of multiplicity k, then —X is also an eigenvalue of multiplicity 
k. Moreover, ifO is an eigenvalue then it has even multiplicity. 

Proof: As in the proof of proposition 14.3.2, we begin with the following 
algebraic manipulations of the characteristic polynomial of A using simple 
properties of determinants (where 1 denotes the 2n x 2n identity matrix) : 

p{X) = det {A — AH) , 

= det (J(H-AII) 

= det {—A^ — Al) , using (14.3.12), 

= det {-A - AIl)^ , 

= det (- {A + All)) = (-1)2 "p(-A) = p(-A). 

From this equality, and the fact that A is real, the first part of the propo- 
sition follows. The rest of the proposition follows from arguments identical 
to those of proposition 14.3.2. □ 

14.3d The Flow Generated by Hamiltonian Vector 
Fields is a One-Parameter Family of 
Symplectic Transformations 

Theorem 14.3.6 Let 4>t{‘) denote the flow generated by the Hamiltonian 
vector field defined on some open, convex set U € Then for each 
t 4>t is a symplectic transformation. Conversely, if the flow generated by 
a vector field consists of symplectic transformations for each t, then the 
vector field is a Hamiltonian vector field. 

Proof: We begin with two preliminary calculations. First we derive the 
following first variational equation that the linearized flow of an arbitrary 
vector field X{x) must satisfy 

^ {D(j)t{x)v) = D , 

= D {X{(j)t{x))v ) , 

= DX{4>t{x)) {D4>t{x)v ) , 



(14.3.17) 
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which holds for any v G 

Secondly, we use (14.3.17) to derive the following identity 

ft{D(j)t{x)u,JD(j)t{x)v) 

= {DX {4>t{x)) {D4>t{x)u) ,JD4>t{x)v) 

+{D4>t{x)u, JDX (</>t(x)) {D4>t{x)v)), 

= -{JDX {4>t{x)) {D4>t{x)u) , D(j)t{x)v) 

-{DX {4>t{x))^ JD4>t{x)u, D4>t{x)v), 

= -{(^JDX {4>t{x)) + DX {4>t{x))'^ {D4>t{x)u) ,D(j)t{x)v). 

(14.3.18) 

Now, let’s assume that the flow is generated by a Hamiltonian vector field, 
Xh- Then, from Proposition 14.3.4, DXh is infinitesimally symplectic, 
therefore 

JDXh + DXjjJ = 0, 

Hence, (14.3.18) reduces to 

’^{D4>t{x)u,JD4)t{x)v) = 0. (14.3.19) 

at 

Integrating (14.3.19) with respect to t, between 0 and t, and using 
D(pt{x)\t=Q = id, we obtain 

{D4>t{x)u, JD(j)t{x)v) = {u,Jv). (14.3.20) 

Therefore 4>t is a symplectic transformation for each t (recall (14.3.2)). 

Conversely, suppose (j)t is a symplectic transformation for each t. Then, 
tracing our steps backwards, (14.3.20) holds, and hence (14.3.18) is zero, 
i.e., 

-{(^JDXh {4>t{x)) + DXh {4>t{x))'^ {D4>t{x)u ) , D4>t{x)v) = 0. 

(14.3.21) 

Using nondegeneracy of the Euclidean inner product and the invertibility 
of D<j>t, the fact that (14.3.21) folds for all u, v G implies that 

JDXh + DX]jJ = 0. 



Hence, DXh is infinitesimally symplectic, and it therefore follows from 
Proposition 14.3.4 that Xh is Hamiltonian. □ 
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14.4 Transformation of Hamilton’s Equations 
Under Symplectic Transformations 

We now discuss the transformation of Hamilton’s equations under sym- 
plectic coordinate changes. However, first we discuss coordinate transfor- 
mations for nonlinear ordinary differential equations in general. Consider 
the following vector field 

x = G{x), xGM". (14.4.1) 

Suppose we want to transform from the “x” coordinates to the “y” coor- 
dinates, where the x and y coordinates are related through the following 
diffeomorphism (which in general we would like to be as differentiable as 
possible) 

y = f{x). (14.4.2) 

Differentiating (14.4.2) with respect to t gives 

y = Df{f~^{y))x. (14.4.3) 

Substituting (14.4.1) into (14.4.3) gives 

y = Df{r\y))G{f-\y)), (14.4.4) 

which is the vector field in the y coordinates. The right-hand-side of (14.4.4) 
gives the rule for how vector fields transform under a coordinate change 
given by (14.4.2). We will use this general result on the way to determining 
how Hamilton’s equations transform under symplectic coordinate changes. 
However, initially we will determine how the Poisson bracket transforms 
under symplectic coordinate changes. This will bring us close to our goal 
since Hamiltonian dynamics can be defined through the Poisson bracket. 
We use the general notation introduced earlier in this section. Suppose 

/= (Q,P) 

X = (q,p) y = f{x) = {Q{q,p),P{q,p)) 



is a symplectic transformation. The Poisson bracket with respect to the 
x = (q,p) coordinates is given by 

{H, G}^ j, = {Xh{x), JXg{x)). (14.4.5) 

Next we want to determine the Poisson bracket with respect to the y = 
{Q, P) coordinates. This can be obtained from (14.4.5) and the rule for the 
general transformation of vector fields given in (14.4.4). Using (14.4.4) to 
describe the transformation of Xh and Xq, we have 

{H,G}qp = {Df{r^{y))XH{rHy)),JDf{f-\y))XG{r\y))). 

(14.4.6) 
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Since the transformation is symplectic, the right-hand-sides of (14.4.5) and 
(14.4.6) are equal and we have 

= (14.4.7) 

Referring back to the Poisson bracket formulation of Hamilton’s equa- 
tions, (14.4.7) shows that the form of Hamilton’s equations is unchanged 
under symplectic transformations. More practically speaking, suppose we 
have a Hamiltonian H = H{Q, P) and a symplectic change of coordinates 
{Q{q,p), P{q,p)). Then the Hamiltonian vector field in the (Q,P) coordi- 
nates is also a Hamiltonian vector field in the (q,p) coordinates with the 
Hamiltonian given by = H{Q{q, p) , P{q, p)) . Or, saying this still another 

way, under symplectic coordinate transformations Hamiltonian vector fields 
transform to Hamiltonian vector fields, and the transformed Hamiltonian 
vector field can be computed by taking the appropriate derivatives with 
respect to the “new” coordinates of the “old” Hamiltonian , which is ex- 
pressed as a function of the “new” coordinates by expressing the ’’old” 
coordinates as functions of the “new” coordinates. 

The reader should compare the discussion in this section with that in 
Section 13.7c. 



14.4a Hamilton’s Equations in Complex 
Coordinates 

For certain calculations we will see that it is easier to use Hamilton’s equa- 
tions defined on C" rather than We will consider the Hamiltonian as 
a real valued function of the complex variables z a, nd z where 



Zj = qj + ipj, Zj = qj -ipj, j = 1, . . . , n, (14.4.8) 



where partial derivatives are related through the following expressions 

) ’ 9zj 2 

The Poisson Bracket of two real valued functions of 2 and z takes the 
following form in complex coordinates 



d 



d 



+ , . (14.4.9) 

dqj dpj 



d 1/9 .9 

dzj 2 \dqj ^ dpj 



{G,i7} = 2z^ 
i=i 



9G dH 

dzj dzj 



9G dH\ 

dzj dzj J 



This can be verified from (14.2.2) by using (14.4.8) and (14.4.9). Hamilton’s 
equations in complex coordinates then take the form 



OH 

= -2t^, 



j = l,...,n. 



(14.4.10) 
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14.5 Completely Integrable Hamiltonian Systems 

An n degree of freedom Hamiltonian system 
f)J-T 

P=-^(q,p), {q,p)€UcR^^, (14.5.1) 

is said to be completely integrable if there exists n functions (called integrals) 

F\ = H,F2, - ■ ■ , Fn, 
which satisfy the following conditions. 

1. The Fi, i = l,---,n, are functionally independent on U, with the 
possible exception of sets of measure zero. 

2. {Fi, Fj} = 0, for all i and j. 

Completely integrable Hamiltonian systems can be “solved” in some 
sense. In particular, they can be reduced to quadratures (this is a result 
due to Liouville and Jacobi, see Arnold [1978]). If the sets defined by 

Fi = fi, 

F2 = / 2 , 



Fn = fn, 

where fi, i = 1, - ■ ■ ,n are constants, are compact and connected manifolds, 
then it can be proved that they are actually n-tori on which action-angle 
variables can be introduced. This was proved by Arnold (see Arnold [1978]), 
and we state his theorem below. First, we define the notation 

Mf = {{q,p) € M"” I F,{q,p) = /„ i = 1, . . . n} . (14.5.2) 

Theorem 14.5.1 (Liouville- Arnold) 

1. Mf is a manifold, as differentiable as the least differentiable integral, 
and is invariant under the dynamics generated by (If. 5.1). 

2. If Mf is compact and connected then it is diffeomorphic to the n- 
dimensional torus 



T” = {(^ 1 , . ..,(j)n) mod27r} . 
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3. The flow generated by (14-5.1) gives rise to quasiperiodic motion on 
T" , i.e. in angular coordinates on Mf we have 

U}{f) = UJnif))- 

4- Hamilton’s equations can he integrated by quadratures. More precisely, 
in a neighborhood of Mf we can construct a symplectic coordinate 
transformation 

where I € B C K", B is an open set, and 6 G T". In these coordinates 
the Hamiltonian takes the form 

H{q{I,e),p{I,9)) = K{I), 



with Hamilton’s equations given by 



i 

e 



f (/)--(/). 



These equations can he trivially integrated 

I = constant, 
0{t) = co{I)t 6 q. 



(14.5.3) 



(14.5.4) 



14.6 Dynamics of Completely Integrable 
Hamiltonian Systems in Action- Angle 
Coordinates 

In this section we want to consider a number of geometrical and analytical 
issues associated with completely integrable Hamiltonian systems expressed 
in action-angle variables, i.e., Hamiltonians of the form 

H = H\I), 

which give rise to the Hamiltonian vector field 
F) 

i=-^{I)=0, (14.6.1) 



dl 






(/,6»)gHxT”, (14.6.2) 
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where B is the ball of radius R in M" and T" is the n-torus. 

It follows immediately from the form of these equations that 

{/ = /g = constant} x T”, 

is an invariant manifold for this system. In particular, as a result of the 
nature of the coordinates, it is an invariant n-torus. Thus the phase space 
is foliated by an n-parameter family of n-tori. Moreover, the trajectories 
on these tori are given by 



I{t) = Iq = constant, (14.6.3) 

0{t) = uj{Io)t + 00- (14.6.4) 

In the remainder of this section we want to examine in more detail the 
nature of this foliation of phase space by n-tori. 

Before proceeding we need to take care of a technical detail. Many of the 
results will require that the Hamiltonian be non- degenerate in a way that 
we now describe. 

Definition 14.6.1 (Nondegenerate) The Hamiltonian is said to he non- 
degenerate if the frequency map 

1 1 — >■ 

is a diffeomorphism. A sufficient condition for this is 

\ 

det (^^(/)j^0. 

14.6a Resonance and Nonresonance 

The n-parameter family of tori in the foliation of the phase space are either 
resonant or nonresonanf and we now define these notions. 

Definition 14.6.2 (Resonance) The frequency vector uj is said to be res- 
onant if there exists k € 2" — {0} such that k-ui = 0. If no such k G Z" — {0} 
exists, to is said to be nonresonant. 

The n-dimensional nonresonant tori have the property that trajectories 
on the tori densely fill out the torus. More precisely, this means that given 
any point on a nonresonant torus, and any neighborhood of that point, 
the trajectory through that point will re-intersect that neighborhood after 
leaving it. Moreover, given any other point and any other neighborhood of 
that point, the same trajectory will also intersect that neighborhood. This 
is a classical result that goes back to Kronecker (the flow on a nonresonant 
torus is often referred to as Kronecker flow). Chapter 23 of Hardy and 
Wright [1938] gives an excellent and detailed exposition of the proof from 
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the number theory viewpoint. A proof of this fact can also be found in 
Arnold [1978]. 

There are different types of resonant tori. One way of describing this is 
through the notion of the multiplicity of a resonance. 

Definition 14.6.3 (Multiplicity of a Resonance) A resonant fre- 
quency vector is said to be of multiplicity m < n if there exist independent 
fc* G 2” — {0}, z = 1, . . . , TO, such that lA ■ to = 0. 

We will discuss the some of the geometrical meanings of the multiplicity 
of resonant tori shortly. However, in a rough sense, resonances of high 
multiplicity are more difficult to analyze and lead to more complicated 
dynamics when perturbed from the integrable setting. The notion of the 
order of a resonance will also play an important role in our studies. 

Definition 14.6.4 (Order of a Resonance) Suppose k-uj = 0 for some 
k € 2” — {0}. Then the order of this resonance is defined to be \k\ = 

Roughly speaking, resonances of high order are less complicated than those 
of low order. This will be quantified later on. 

The following proposition describes some aspects of the dynamics on 
resonant tori of multiplicity to. 

Proposition 14.6.5 (Foliation of Resonant Tori) Suppose the n- torus 
I = I* is resonant of multiplicity m < n, i.e., oj{I*) is a multiplicity m 
frequency vector. Then the dynamics on the n-torus 1 = 1* is such that it 
is foliated by invariant tori of dimension n — m with trajectories densely 
filling out these lower dimensional tori. 



Proof: On a resonant invariant n-torus / = /* of multiplicity to the dy- 
namics is given by 

dH° 

9 ="^{1*)= coin, 9€T^. 
and the resonance relations are 

h ■ coin = hiLOiil*) + • • • + kinCOnil*) =0, fc, G 2” - {0}, Z = 1, . . . , TO, 

with the ki’s being linearly independent over the integers. Therefore, the 
m X n matrix 

( kii . . . kiji 

K= : : 

\ ^ml • • • kmn 

has maximal rank to. Without loss of generality we may assume that the 
first TO columns of the matrix K are linearly independent, and write 

( fell . . . kim 
Ki= : : 

\ kmi . . . k mm t 
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and 



Ko = 



^ • • • ^In 

■ • • ^mn 



Then Ki is non-singular. Introduce the linear transformation of coordinates 
0 ^ (j) on T" by 



(j)l — kixO\ 4“ ... 4“ l — 1, . . . , 77T-, 

4>i = 6i, z = TO 4- 1, . . . , n. 



This transformation is a diffeomorphism because it is linear and nonsingu- 
lar, as the following calculation demonstrates: 



det 




K2 

(n—m) X (n— m) 



det(itTi) ^ 0. 



where id(„_m)x(n-m) denotes the {n — m) x {n — m) identity matrix. In 
these new coordinates, the dynamics on the resonant n-torus of multiplicity 
TO are given 

^i = {Kuj{r))^ = Q, z=l,...,TO, 

(pi=uji{r), i = m+l,...,n. 

These equations are easily solved to yield the trajectories 



4>i = 4‘i = constant, i = 1, . . . , to, 

(j>i = = u)^{r)t + i = m+l,...,n. 

Hence, for a fixed 'tp* = {cjil, . . . , <p’^), ...,(pl)G T""™ parametrize 

a torus of dimension n — m. Thus, the n-torus T" specified by / = /* is 
foliated into an nz-parameter family of (n — to) tori. 

To show that the trajectories are dense in the (n — m)-tori, we only need 
to verify that the frequencies LOm+i{I*), ■ ■ ■ are not in resonance. 

Suppose it is not so, then there exist . ,k'^ not all being zero such 

that 

k'^j^-^LOm+li.1*) 4- ... 4- k'^UJni.1*) = 0. 

Without loss of generality we may assume that yf 0. Let k' be the 

n vector (0, . . . , 0, , • • ■ , Then k' and ki,i = 1, . . . , to are linearly 

independent because the matrix 




has a nonsingular (to 4- 1) x (to 4- 1) submatrix 



/ kii . 


• ■ ^Im 




krai • 


h 

• • ^mm 




\ 0 . 


.. 0 


^m+1 / 
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This contradicts the assumption that w(/*) is resonant of multiplicity m. 

Since u>rn+i{I*) ■ ■ ■ are nonresonant and constants, the corre- 
sponding trajectories of (</'„+!) ■ • ■ ) 4‘V) T"-™ are quasiperiodic wind- 

ings which fill the torus densely. The proof can be found in many references, 
see the comment following Definition 14.6.2. □ 

Tori of multiplicity n — 1 are often referred to as periodic tori since they 
are foliated by 1-tori, i.e., periodic orbits. 

The Notion of Measure. The term measure of a set will be used in 
various places throughout this book. By this we will mean nothing more 
than ordinary Lebesgue measure on R". Most of the sets of interest will 
be quite well-behaved so that “measure” and “area” or “volume” will 
be synonomous. For any set U G R", we will denote the measure of U 
by mes(?7). 



Proposition 14.6.6 Suppose that 



det 



dl,dlj 



^0 



in B. Then the nonresonant values of I are dense in B and occupy a set 
of full measure. Moreover, the I values corresponding to nonresonant tori 
of dimension n — k are also dense in B, hut occupy a set of zero measure, 
for k = 1, - ■ ■ ,n — 1. 

Proof: We introduce the following notation: 

Qb = {w G M" I w = oj{I), for some / G B}, 

rir = {w G M" — {0} I fc • w = 0 for some k G 2" — {0}}, 

fig = {to G Q" — {0}}. 



Namely, fls is the set of all frequencies under consideration, fir is the set 
of all resonant frequencies and flq is the set of all frequencies with rational 
components. 

First we argue that fir is dense in K”. This follows immediately from the 
following facts. 



1 . fl q t fl r . 

2. ftq is dense in K” — {0}. 

To show that the set Pr = {I G B \ to{I) G fir} is dense in B, we only 
need to note that 



dooi 

Wq 



= det 



/ a^i/o \ 
\dmj 



det 



^0. 
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Hence the map 

g I (z: B I — y 

is a diffeomorphism on B so that Pj. = C\ B is dense in H fl M” 

follows. 

To show that 

mes(Pr) = 0 

we define the map 

fk ■ I B 1-^ kiLOi{I) + . . . + knU>n{I) G R 

for each fixed k G 2” — {0}, and denote the set of I corresponding to this 
particular resonance relation as 

P^ = {/ G Pr I fk{I) = 0, for the given A: G 2" — {0}}. 

Then is the zero set of fk 

fk{I*) = kiUJiiP) + . . . + knUJuin =0, r G P^ 

and P* C Pr- We have 






since 




Hence, 0 is a regular value of fk so that the zero set of /, P* is a manifold 
of dimension n — 1. Then 

mes(Pj) = 0. 

Notice that 

Pr = U P- 

fcez^-{o} 

k-io{I)=^0 

and the set 2" — {0} is countable. Then 

mes(Pr) = 0 

follows. 

The rest of the proposition is straightforward. Note that the nonresonant 
set 

P = {/ G 5 I fc • w(/) = 0 fc = 0 G 2”} 
is complementary to the resonant set P^ in B, hence 

mes(P) = mes(H). 



so that P has the full measure so it is dense in B. 



□ 
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14.6b Diophantine Frequencies 

Certain nonresonant frequencies play an important role in the proof of the 
KAM theorem. These are the diophantine frequencies, which we now define. 
For T, 7 > 0 

Q(t,7) = {w G K" I Iw • fc| > 7|fc|-^ Vfc G 2” - {0}} (14.6.5) 

where 

|fc| = sup \ki\, 

i 

and we set 

= U fl(U7)- 

7>0 

It is important to note that the frequencies satisfying (14.6.5) satisfy a 
countable infinity of inequalities. These inequalities quantify the character- 
istic of these frequencies not being close to resonance. 

The following proposition is classical. All pieces of the proof can be found 
in Cassels [1957], Schmidt [1980], Arnold [1963], Russmann [1975], and 
Lochak and Meunier [1988]. For completeness, we provide a few additional 
details. 

Proposition 14.6.7 

1. For 0 < T < n — 1, Q(r) = 0. 

2. For T = n — 1, n(r) has Lebesgue measure zero. 

3. For T > n — 1, K" — n(r) has Lebesgue measure zero, and more 
precisely 

mes{(K”-Q(T,7))nRfl} < 
where Bn denotes the ball of radius R in M". 

Proof: 

Proof of 1. The proof of 1 relies on the following result that we state 
without proof. 

Theorem 14.6.8 (Dirichlet’s Theorem, Cassels) [1957], pg. 14 

For any 0 = (0i, • • • , 0m) G K"* the inequality 

\\p-0 ][ = ][ Pi6»i H VPmOm |[< br™ 

is satisfied by infinitely many p G 2™, where the notation j] • jj is defined 
by 

V6»gM, ]] 6» 11= inf 16»-g]. 



(14.6.6) 
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Without loss of generality we can assume yf 0. We set 0i = i = 

1, • • • , n — 1. Then, applying Dirichlet’s theorem, for any • • • , ^ G 

the inequality 

II + + ||< |ft|-("-i), (14.6.7) 

LOn 

is satisfied by infinitely many k = (fci, • • • , kn-i) € 2"“^. This implies that 

(14.6.7) has solutions for arbitrarily large |fc|. 

Now we are interested in solutions of the following inequality (viewing 
(wi, • • • ,u;„) as given) 

||fci— H + ^ ||> 7|fc|~'^, 0<T<n— 1. (14.6.8) 

C(J-n UJn 

Then, for a given (wi, • • • , o;„), simultaneous solution of (14.6.7) and 

(14.6.8) implies 



0 < 7|fc|-" <11 + • • • + kn-i"^ ||< |fc|(”-'\ 

UJr). ^n. 



or 



7 < 



At ^ + • • • + kn-\ 



<lfcl 



— n+l+r 



(14.6.9) 



Now for fixed 7 > 0 the right-hand-side of this bound can be made smaller 
than the left-hand-side for sufficiently large |fc|, provided 0 < r < n — 1. 

Hence, we conclude that for any 0 < r < n — 1, and for any , • • • , 
the inequality 

II + + I|>7|A|-", (14.6.10) 

UJri 

does not hold for all k G 
From (14.6.6) we have 



1 ; I I ; ^"-1 I ; I ^ • 1 ; “^1 I I ; ‘^n-l , , 

|fci 1- • • • -I- kn-i h kn\ > mm \ki 1 -|- fc„_i h k„ 

LOyi ^T), OJr) ^T) 



II 7 

= II ki — 

^n. 



kf^lL 



n— 1 



-I- ||> 7|fc| 

(14.6.11) 



Now recall |fc| = max |fcj|, and for fc yf 0 \k\ > 1. Then if we define 

= (fci,---,A:„_i), 



and 

fc" = (fci,---,M, 

where the first n — 1 elements of fc” are the same as we have 

\k-\>\k--^\, 
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and, thus, 

Using this fact, along with (14.6.11), we conclude that the inequality 

\kiLOi H h kn-lUJn-l + kntOn\ > w„7|/c|“^, k G 2”, 0 < T < n - 1, 

(14.6.12) 

is not solved for all k G 2". We have shown that U {T,uJn'f) = 0 for any 

7j 7^ 0- 

Proof of 2. This is the difficult case, and the reader is referred to Schmidt 
[1980]. 

Proof of 3. Here we follow Arnold [1963] and Lochak and Meunier [1988]. 

w • fc = 0 is a resonant plane passing through the origin in M" (and the 
integer vector k is perpendicular to the plane). A neighborhood of this 
plane is given by 

{wGM”||wfc| < 7 |fc|-^}, (14.6.13) 

and the width of this neighborhood is given by 

2j\kr~\ 

It is only in these neighborhoods of the resonant planes that the inequality 

\uj-k\> j\k\~'^, 

does not hold. The volume of one of these neighborhoods is given by 
mes{{w gM”| |w/c| < 'llkl-^} D Br} = 2C-fR’^-^\k\^~\ (14.6.14) 

where (7 is a constant depending on Br. Hence we have 

mes{(R”-U(r,7))nHfi} = 2C7i?”-i ^ \k\~^-^ 

fcGZ"-{0} 

= C(T)7i?”-i (14.6.15) 

which goes to zero as 7 — >■ 0 and where 

C(r)^2C' ^ \kr~\ 

feGZ"-{0} 



This sum converges only if r > n — 1, we leave the details of this last fact 
to the reader. □ 
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14.6c Geometry of the Resonances 

Here we describe a few features associated with the geometry of resonances. 
Essentially, these amount to properties of the equation(s) 

fc-w(/) = 0, fceZ”-{0}. 

We can view these properties as manifested either in action space or fre- 
quency space by use of the frequency map 

IgB. (14.6.16) 



Action Space 

In action space we make the following observations. 

• The solutions of the equation k ■ lo{I) = 0, k fixed, generically form 
a hypersurface in B. 

• The solutions of the r < n equations fc* • u{I) = 0,i = l,...,r, 
generically form a surface of codimension r in B. 

We also easily conclude that resonances are nested - a resonance of mul- 
tiplicity r is contained in a resonance of multiplicity r — 1. 

Frequency Space 

In frequency space the equation(s) k ■ lo = 0 represent a system of linear 
equations. It is natural to leave out the explicit dependence of a; on / in 
this representation. 

Energy Conservation 

In frequency space the resonance relation restricted to H^(I) = h = con- 
stant can be written as 

k\ h . . . -|- kn-i h — 0 

U)n 

where we suppose (without loss of generality) uj„{I) does not vanish in the 
region of interest. 



Resonance Structure Versus the Number of Angles 



n=l In this case resonance implies w = 0. 



n=2 In this case the resonance relation restricted to the energy surface is 



given by 



ki— k2 = 0. 
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Thus the resonances occur at isolated points in ^ space. Moreover, for 
nondegenerate systems they have different energies (i.e., different values 
for the Hamiltonian). We say that for two degree-of- freedom systems the 
resonances are energetically isolated. 



n=3 In this case the resonance relation restricted to the energy surface is 



given by 



ki — + k2— + ks = 0. 

W3 W3 



The resonance relation defines lines in ^ ^ space. For a fixed {ki, k 2 ,ks) 

the line corresponds to a multiplicity one resonance. Generically, if two 
lines intersect they do so in an isolated point. Such a point corresponds to 
a multiplicity two resonance. As (fci, k 2 , k^) runs through the resulting 
lines are dense in — — — space. This is a significant difference between 
two degree-of-freedom systems and systems with three or more degrees-of- 
freedom. In the former case the resonances are (generically) energetically 
isolated. In the latter case they are not. 



14.7 Perturbations of Completely Integrable 
Hamiltonian Systems in Action- Angle 
Coordinates 

This is a huge area, a good up-to-date and general reference is Arnold et 
al. [1988]. However, the two most important theorems are those due to 
Kolmogorov [1954]-Arnold [1963] -Moser [1962] (KAM) and Nekhoroshev. 
Here we state them. The Hamiltonian we consider is of the form 

H{I,6) ^ H°{I) + eH^{I,0), {I,9 )gBxT’^. (14.7.1) 

Below is the statement of the KAM theorem due to Poschel [1982]. 

Theorem 14.7.1 (KAM) Let the integrable Hamiltonian he real an- 
alytic and nondegenerate, and let the perturbed Hamiltonian H = H^ -\-eH"^ 
he of class C"’ with r > 2n. Then, for sufficiently small e proportional to 
7 ^, the perturbed system possesses smooth invariant n-tori with linear flow 
for all to G 0 (n, 7 ), i.e., restricted to the invariant n-tori, the vector field 
is analytically conjugate to 4> = uj. 

What about the lower dimensional tori described in Proposition 14.6.5? 
Recent results of Eliasson [1988], and Poschel [1989] prove the persistence 
of lower-dimensional “elliptic” tori, and results of de la Llave and Wayne 
[1990], and Treschev [1991] prove the persistence of lower-dimensional “hy- 
perbolic” (or “whiskered”) tori. 

The following theorem is originally due to Nekhoroshev [1977], but it 
has recently been sharpened considerably in the work of Lochak [1992], 
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Lochak and Neishtadt [1992], and Poschel [1993]. It is a result that requires 
analyticity of the Hamiltonian, as well as either convexity or quasiconvexity 
of the unperturbed Hamiltonian. 

The function H^{I) is said to be convex if 

II ||< M II u II, \7^H°{I)vv>m\\vf, V/ € B, u € R", 

and where 0 < m < M. Quasiconvexity is convexity restricted to the 
level set of the Hamiltonian. 

Theorem 14.7.2 (Nekhoroshev) For any initial condition (7(0), 0(0)) 
one has 

\I{t) - 7(0)1 < C'le'’ for |t| < C'2exp(C'3e““) , 

where a = b = l/2n, C\, C 2 , and C3 are constants, and provided e is small 
enough. 



14.8 Stability of Elliptic Equilibria 

Suppose (14.0.1) has an equilibrium point, which without loss of generality, 
we can assume to be at {q,p) = (0,0). We are interested in the stability 
of the equilibrium. Therefore, as a first step, we linearize (14.0.1) about 
the equilibrium and examine the eigenvalues of the matrix associated with 
the equilibrium. This matrix is an infinitesimally symplectic matrix and so 
Proposition 14.3.5 describes the nature of the eigenvalues. If all eigenvalues 
lie on the imaginary axis, but none are zero, then the equilibrium is said 
to be an elliptic equilibrium. Morse theory can be used to prove Liapunov 
stability of a large class of elliptic equilibria. We begin by developing the 
necessary background to prove this result. 

Definition 14.8.1 (Nondegenerate Critical Point) Suppose 
F : M" — >■ K is a C"’, r > 3, or analytic function. Suppose x = Xq 
is a point such that §y(a^o) = 0- Then Xq is said to be a critical point. 
If Xq is such that det^^(xo) yf 0 then it is said to be a nondegenerate 
critical point. 

Definition 14.8.2 (Morse Function) If x = xq is a nondegenerate crit- 
ical point of F{x) then F{x) is said to be a Morse function in a neighborhood 
of Xq. 

Suppose we Taylor expand F{x) about a: = 0. The n x n matrix associated 
with the second derivatives is symmetric. Therefore it can always be diag- 
onalized by a linear transformation. We assume that this has been done, 
after which the Taylor expansion assumes the form 

F{x) = F{Q) - cix\ Ckxl + Cfc+iXfc+i H h c„a;^ -b 0(3), 
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where Ci > 0, i = 1, . . . ,n. The integer k is called the index of the critical 
point. Then we have the following Morse lemma. 

Lemma 14.8.3 (Morse) If F is a C’’, r > 3, or analytic Morse function 
near x = 0, then in a neighborhood of x = 0 there exists a C^~^, or analytic 
diffeomorphism, which transforms F to the form 

G{x) = G(0) — vl — ■ ■ ■ — Vk + Vk+i ”1 + Un- 

Proof: See Golubitsky and Marsden [1983]. □ 

One now should be able to guess how these results are used to study sta- 
bility of elliptic equilibria in Hamiltonian systems. At an elliptic equilibrium 
the Hamiltonian is a Morse function. Now suppose the matrix associated 
with the second derivative of the Hamiltonian, evaluated at the equilibrium 
point, is positive definite. Then the equilibrium point has index zero and 
the Hamiltonian is locally conjugate to a Hamiltonian of the form 

H{q,p) - i?(0,0) = ql~\ ^ql+pj H \~ pI- 

Hence the energy surfaces are locally diffeomorphic to a family of spheres 
that shrink down to the point (0, 0) as i? — >■ H{0, 0). Since the trajectories 
are tangent to the energy surfaces Liapunov stability follows. In the case 
that the matrix associated with the second derivative of the Hamiltonian, 
evaluated at the equilibrium point, is negative definite we apply the same 
reasoning to the time reversed vector field (i.e., let H — >■ —H) and obtain 
the same result. 



14.9 Discrete-Time Hamiltonian Dynamical 
Systems: Iteration of Symplectic Maps 

The dynamics of symplectic maps can be viewed as a discrete time ana- 
log of the dynamics generated by Hamiltonian vector fields. Proposition 
14.3.2 gives us information about linearized stability of periodic orbits of 
symplectic maps and, hence, the local invariant manifold structure (i.e. 
existence and dimensions of stable, unstable, and center manifolds of pe- 
riodic orbits). There are also discrete analogs of the KAM theorem and 
Nekoroshev’s theorem, which we now describe. 

14.9a The KAM Theorem and Nekhoroshev’s 
Theorem for Symplectic Maps 

As in the continuous time case, the setting for these theorems is that of 
perturbations of integrable symplectic maps. Before stating the results we 
must develop the appropriate setting. 




224 



14. Hamiltonian Vector Fields 



Let Bs denote the open ball in M” of radius 5. Then the 2n-dimensional 
annulus is denoted by As = Bs x T". We define the following map on As'. 

fo • As — >■ As 



(/, 9) ^{1,9 + = foil, 9). (14.9.1) 

(14.9.2) 

It should be clear that As is foliated by an n-parameter family of invariant 
n-tori. We assume that ui is given by the gradient of a function, i.e., 

oj = Vh. 

In this case fo is a globally symplectic map with generating function h. 

Now we want to consider perturbations of fo, but not just any perturba- 
tions. Rather, we only wish to consider perturbations where the resulting 
perturbed map is also symplectic. This can be achieved through the use 
of generating functions, for which we refer the reader to Whittaker [1904], 
Goldstein [1980], Abraham and Marsden [1978], Arnold [1978], or Section 
13.7c for more background. For our limited purposes one could just take 
the following as the definition of the maps under consideration. 

We consider the map generated by a perturbation of h: 

ni',0) = hil') + <j{l',9), 

where a is small, with || a ]|= e. Then the perturbed map f : As ^ As is 
implicitly defined by 

/ : As — >■ As, 



{I',9')- - —,9 + u{I') + , (14.9.3) 

where primes denote the image of the respective coordinate under the map. 



Nekhoroshev Theorem for Symplectic Maps 

The discrete version of Nekhoroshev’s theorem was first announced in 
Nekhoroshev [1977]. Details first appeared in Kuksin and Pdschel [1994], 
but see also Bazzani et al. [1990] and Lochak [1992]. We first establish 
some notation. Let /®(/,0) = (G,6*s), with (Io,9o) = (1,9). We assume 
that {Is,9s) remains in As for all times under consideration. 



Theorem 14.9.1 Suppose h and a are analytic and h is a convex function. 



If(^ =11 o- ]]< eo 
where a = b = 



then \\ Is — I ||< ce° when |s| < cexp(e “), s G Z, and 



2n+2 ■ 



Proof: See Kuksin and Poschel [1994]. 



□ 
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KAM Theorem for Symplectic Maps 

The discrete version of the KAM theorem was first worked out in the 
analytic setting by Douady [1982] (although the two dimensional Moser 
twist theorem in the finitely differentiable case was obtained much earlier) . 
Kuksin and Pdschel [1994] provide a new proof. First we establish some 
notation. 

We assume that the unperturbed map is nondegenerate in the following 
sense: 

det y^h{I) = det — yf 0. 

This condition is often referred to as the twist condition. We are concerned 
with the preservation tori whose unperturbed frequency vectors satisfy the 
following diophantine condition: 

|27rp + (7iWi + • • • + > c ([<71 1 + • • • + |g„|) (14.9.4) 

for all q G 2" — {0} and p G 2, and where c > 0 and j > n are constants. 

Theorem 14.9.2 For analytic f, and for e sufficiently small, the perturbed 
map f possesses analytic invariant n-tori for all to satisfying (If. 9. 4). 
Moreover, the dynamics restricted to the invariant n-tori is analytically 
conjugate to the rigid rotation 4>-\- ui. 

Proof: See Kuksin and Poschel [1994]. □ 



14.10 Generic Properties of Hamiltonian 
Dynamical Systems 

The special structure of Hamiltonian dynamical systems (both continuous 
and discrete time) give rise to additional generic properties than those 
described in the usual Kupka-Smale theorems discussed in Chapter 12. 
The classic papers on this subject are Robinson [1970a, b], Takens [1970], 
[1972], Newhouse [1977], and Pugh and Robinson [1983]. Here we content 
ourselves with just stating a few of these properties, which we will expand 
upon when we learn more about global dynamics later in the book. 

For Hamiltonian systems the following properties are generic (in the 
sense of holding for a residual subset in the appropriate “space of dynamical 
systems”). 

Hyperbolic periodic orbits are dense in the phase space (Pugh and Robin- 
son [1983]). 

Every hyperbolic periodic orbit has a transverse homoclinic point in any 
neighborhood of any point in the phase space (Taken [1970], [1972], 
Newhouse [1977]). 
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The reader should take great care in applying these generic results to 
specific dynamical systems following the caveats described in Chapter 12. 
In a measure theoretic sense, residual sets can be “small”. Herman [1991] 
has constructed an explicit example of a Hamiltonian system with no peri- 
odic orbits. Another important point for applications is that “generic type 
theorems” are typically proved in a C” setting. In many applications the 
dynamical systems are analytic. The C"" techniques generally make the 
results inapplicable in an analytic setting. 

14.11 Exercises 

1. Prove that (•, J-) is a symplectic form on 

2. Suppose a 2n X 2n matrix of real numbers A satisfies 

A'^JA = J, 

where A has the form 




and a, b, c, d are n X n matrices. Show that 

(a) The n X n matrices c and d are symmetric, 

(b) d — c^b — id, 

(c) detA = 1. 

3. Prove that the composition of two symplectic transformations is a symplectic trans- 
formation. 

4. Consider a (real), canonical autonomous Hamiltonian vector field having an equilib- 
rium point. 

(a) Can the equilibrium point be asymptotically stable? 

(b) Can the equilibrium point have an odd dimensional center manifold? 

5. Consider a (real), symplectic map having a periodic orbit. 

(a) Can the periodic orbit be asymptotically stable? 

(b) Can the periodic orbit have an odd dimensional center manifold? 

6. Prove that the flow generated by a Hamiltonian vector field is volume preserving. 

7. Prove that symplectic maps preserve volume. 

8. Prove that two dimensional volume preserving vector fields are Hamiltonian. 

9. Construct an example of a vector field in dimension 2n, n > 2, that is volume preserv- 
ing but not Hamiltonian. 

10. Consider a Hamiltonian vector field in the plane having a hyperbolic equlibrium point 
connected to itself by a homoclinic orbit. Is the homoclinic orbit structurally stable? 
Consider a heteroclinic connection between two hyperbolic equlibrium points. Is this 
situation structurally stable? Do the results change if the equilibria are not hyperbolic? 
Do the results hold in higher dimensions? 
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11. Consider the following vector field in complex coordinates Zz — +zyi, 22 = ^2 -\-W 2 ' 
Zl — i[—{o' — f3)zi + Azi + TZiZ^Zi + 772212222 + '^z'ZyZ^], 

22 = f[— + &)^2 + A 22 + 77122 22 + '^2Z\Z‘2.Z\ + 773222^]. 

This equation is a normal form that arises in the study of a variety of two degree-of- 
freedom parametrically forced mechanical systems (see Feng and Wiggins [1993]). The 
parameters represent 

• a — difference between the forcing frequency and the sum of the two natural 
frequencies. 

• (3 — difference between the two natural frequencies. 

• A — amplitude of the sinusoidal excitation. 

• 771 , 772 , 773 — coefficients of the nonlinear terms depending on the mode numbers 
and geometrical properties. 

(a) Show that this vector field is Hamiltonian. 

(b) Show that through rescaling we can set 

A ^ 1, 771 ^ -1, 

and that it the rescaled cartesian coordinates the vector field is given by 

Xi = (<T - ttiE + l)y^ - 2 t73Lx 2 - 0yi + Cyi(xl + y^) 

y'l = (- 0 - + ttiE + l)xi - 27r3Lj/2 + 0xi - Cxi(x‘l + j/j) 

X2 = (<T - ttiE + 1)1/2 + 2tt3Lxi + 0y2 + Cy^ix^ + y,) 
y'2 = (-0- + ttiE + 1)0:2 + 27 r 3 Lyi - 0X2 - Cx2(x\ + y,) 

(14.11.1) 



where 

n 2 , 2 , 2 , 2 

£ = Ki + yi + + y2 , 

L = xiy2 - X2yi, 

C — TTi — 7T2 — 7T3. 

The parameters are now reduced to 

d, (T, /3, C, 773. 

(c) Show that (14.11.1) is Hamiltonian with Hamiltonian given by 

TT T-- I ^ T-.2 , r2 , 1/ 2 , 2 2 2s 

H ^ -E + -E + 773 L + -( 7/1 + ^2 - - ^ 2 ) 

I / 2 , 2 2 2s , C” / 2 , 2s / 2 , 2s 

+ 2 ^^2 + 2/2 - - Vl) + ^( 2^1 + yi){^2 + V2)- 

(d) Consider the transformation 

x\ — q\ cos q 2 , 

X 2 — qi sing 2 , 

yi ^ Pi cos 0-2 - p 2 qT^ sinq 2 , 
y2 — Pi sing2 + P2qT^ ^2, 



(14.11.2) 
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with inverse 

qi = ^Jxl+x^, 

xiyi + X 2 V 2 

Pi = , 

^xl+xl 

. -1 *2 
q 2 — tan — , 
xi 

P 2 = XIU 2 - X 2 yi. (14.11.3) 

Show that this transformation is canonical. 

(e) Show that in this new coordinate system the vector field is Hamiltonian with 
Hamiltonian given by 

H = Ho + eHi = -E + -E + ti 3P2 + ~{Pi + P2h ~ h) 

+ (14.11.4) 

where 

i=n 2 , 2 , 2 -2 

E = q-o + Pi + P2«l > 

D = 2pip2qi^ sin2g2 + {P2<li^ ~ <li ~ Pi) cos 2(j2. 

(f) Show that for /3 — C — 0 this system is completely integrable. 

12. Consider the following two degree-of-freedom system in complex coordinates 

-ic + (i|c|^ + i|6|^ - l)c+ i(c6 + be)6 = 0, 

-ii) + (i|c|^ + 5|6|^ - (1 + fc^))b + i(c6 + 6c)c = 0, 

where is a real parameter. This equation arises in the study of a two-mode truncation 
of the nonlinear Schrodinger equation, see Bishop et al. [1990] and Kovacic and Wiggins 
[1992], 

(a) Show that this system is Hamiltonian. 

(b) Show that the coordinate transformation 

c — |c|e*^, b — {x iy)e^^ , 

is canonical, and be careful to describe where it is defined. 

(c) Show that in these new coordinates the vector field takes the form 

X = -k^v - -x"^y + 

4 4 

y — {k — 2I)x + -^x^ -\- —xy , 

/ - 0, 

e = 1- I - x"^, (14.11.5) 

and it is Hamiltonian with Hamiltonian given by 

H = i/" - / - ^x'^ - + (/ - (14.11.6) 



(d) Show that the system is completely integrable. 
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(e) Sketch the phase portrait of the x — y component of (14.11.5) as a function of I 
and k. 

13. Consider the following three degree-of-freedom Hamiltonian given in complex coordi- 
nates 

He — - (1 -I- ed) l^il^ -I- |^2|^ + e — R-e (14.11.7) 

where d, e, and a are real parameters. This Hamiltonian arises in the study of 
dynamics near an elliptic equilibrium point in 1 : 2 : 2 resonance, see Haller and 
Wiggins [1996]. 

(a) Show that it is completely integrable with integrals given by 

Ha, Jl = + |^2|^ J2=^k3l^ (14.11.8) 

(b) We define the following coordinate transformations: 

= k = 1,2,3, (14.11.9) 

■01 = 01 , Ki — Ii + 2/2 + ^^ 3 -^ 3 , 

02 — 03 — ^301, -^2 — I 3 , (14.11.10) 

Xi — y/2l2 sin (02 — 20i), X2 — V 2 T 2 COS (02 — 20i), 

Composing these two transformations gives us a map 

(2, z) (x, /sT, 'tp) . 

Show that the resulting transformation is canonical. 

(c) Show that in the {x, K, 0) coordinates the Hamiltonian takes the form 



Hc{x, K.'tp) — Ki e{d-\- ax 2 ) (Ki — UJ 3 K 2 — \x\ ). 

(d) Show that the vector field corresponding to He takes the form 

xi — e[a{Ki — uj^K 2 — x^ — x^) — 2 (d ax 2 ) 2 : 2 ], 
X 2 — ^ 2 xi (d 4- ax 2 ) , 

H 2 - 0, 

02 = —ecos (d -I- ax 2 ) , 

Hi - 0, 

01 = 1 -I- € (d -I- ax 2 ) ■ 



(14.11.11) 



(14.11.12) 



(e) Sketch the phase portrait of the xi — X 2 component of (14.11.12) as a function 
of Ki — (jJ^K 2 and d. 

14. The following two degree-of-freedom Hamiltonian is the normal form (through third 
order) describing the behavior near an elliptic equilibrium point in 1 : 2 resonance: 

^^(-31,^1, 22, ^ 2 ) — + 1 22 !^ 4-2a Re 2 ^ ^2 ~ 26 Im 2 ^ 22 , (14. 11.13) 



where a and h are real. Let 



c = a -1- z6, c = |c|e 



and consider the coordinate transformation 



— I arg c x arg c 

(21 , 2^ 22 , 22 ) !->■ ( e 2 2i , e 2 2i , 22 , 22 



Show that this transformation is canonical, and that the Hamiltonian becomes 



7/(21, 21, 32 , 22) = - |2if + |22r + 2 |c| Re 2^22. 



(14.11.14) 
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15. Show that the Poincare map constructed in section 10.4 is symplectic. 

16. Consider the following frequency vectors 

(a) a; — 0. 

(b) a; — 1. 

(c) a; = (1,0). 

(d) uj = (v^.O). 

(e) uj = (\/2, 1). 

(f) UJ = (y/2,0,0). 

(g) iu = (V2,0,l). 

(h) UJ = (V2, 1, 1). 

(i) UJ = (V2, V2. 1). 

(j) UJ = (V2.V2.V2). 

(k) UJ = (V2, 2, 1). 

(l) UJ = (1.1.1). 

(m) ij — (1,2,1). 

(n) cj ^ (1,2,3). 

(o) oj ^ (1,2,8). 

Which of these is resonant? For the the resonant frequency vectors, compute the mul- 
tiplicity of the resonance. 

17. Compare the Liouville-Arnold Theorem (Theorem 14.5.1)with the results obtained in 
Exercise 20 of Chapter 13. 
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In this chapter we consider vector fields that are given by the gradient 
of a scalar valued function, which are referred to as gradient vector fields. 
Gradient vector fields arise in a variety of applications. For example, they 
arise in the study of neural networks (Haykin [1994]) and in the study of 
electrical circuits and networks (see Hirsch and Smale [1974]). 

Consider a vector field of the form 

x = -VV{x), ccGM”, (15.0.1) 

where V(x) is a scalar valued function on R" that is C"", r > 2. The minus 
sign in front of the gradient is traditional and imposes no restriction as we 
can always redefine V(x) as —V(x). The special structure of this vector 
field, i.e., the fact that it is a gradient of a scalar valued function, imposes 
strict constraints on the nature of the dynamics, as we shall see. 

First, it should be clear that equilibrium points of (15.0.1) are relative 
extrema of V(x). Moreover, at any point except for an equilibrium point, 
the vector field (15.0.1) is perpendicular to the level sets of V{x). We can 
get more information by differentiating V(x) along trajectories of (15.0.1), 
i.e.. 



V(x) = VV(x) ■ X, 

= VV{x) ■ (-VG(x)) , 

= -\VV{x)\\ (15.0.2) 

where is the Euclidean inner product and | ■ | is the induced Euclidean 
norm. The next result follows immediately from this calculation. 

Proposition 15.0.1 V{x) <0 and V{x) = 0 if and only if x is an equi- 
librium point of (15.0.1). 

Using V (x) to construct an appropriate Liapunov function, we can obtain 
stability information about certain equilibrium points. 

Proposition 15.0.2 Suppose x is an isolated minimum ofV{x), i.e., there 
is a neighborhood of x that contains no other minima ofV{x). Then x is 
an asymptotically stable equilibrium point of (15.0.1). 
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Proof: A simple calculation shows that V{x) — V (x) is an appropriate Lia- 
punov function from which the result follows by applying Theorem 2.0.1. 

□ 

It is also easy to see that the matrix associated with the linearization 
about an equilibrium point of a gradient vector field can have only real 
eigenvalues. This follows from the fact that —V'^V{x) is a symmetric ma- 
trix, and symmetric matrices have only real eigenvalues. 

Trajectories of gradient dynamical systems have very simple asymptotic 
behavior, as the following theorem shows. 

Theorem 15.0.3 Suppose x is an to limit point of a trajectory of (15.0.1). 
Then x is an equilibrium point of (15.0.1). 

Proof: If X is an omega limit point of a trajectory, say 4>t{x), then we 
can show that V (x) is constant along the trajectory through x. Hence, 
V{x) = 0. It then follows from (15.0.2) that x is an equilibrium point. 

So we show that V{x) is constant along the trajectory through x. By 
the definition of omega limit point, there exists a sequence {ti}, ti — >■ oo as 
i — >■ oo such that 

lim (jti (x) = X. 
i—¥oo 

Let X = V{x), then x is the greatest lower bound of the set {V{(j)t{x)) \t > 
0}. This follows from the fact that V{x) decreases along trajectories (hence 
V{4>n{x)) > V{(f)t{x)) > V{4>ti^i{x)) for ti < t < ti+i) and by the continu- 
ity of V{x). From Proposition 8.1.3 the omega limit set of a trajectory is 
invariant, hence 4>t{x) is also an omega limit point of 4>t{x). Then, since x 
is the greatest lower bound of the set {V{(j)t{x)) 1 1 > 0}, V{<pt{x)) = X- bl 
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1. Show that gradient vector fields cannot have: 

(a) periodic orbits, 

(b) homoclinic orbits, or 

(c) heteroclinic cycles. 

2. Can gradient vector fields have heteroclinic orbits? 

3. Give sufficient conditions on V{x) so that the trajectories of (15.0.1) exist for all time. 

4. Describe the time evolution of the volume of a region of the phase space under the 
flow generated by a gradient vector field. 

5. Consider the vector field 

— u>i w sin (^2 ~ ^i); 

^2 — u>2 + u> sin{^i — ^2), ( 9 i, 62) ^ X , 



where u;i, uj 2 , and w are parameters. 
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Show that this is a gradient vector field. 

Determine the phase portrait for this vector field. 

Can this vector field have periodic orbits? Would the existence of periodic orbits 
contradict Theorem 15.0.3? 
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In this section we introduce the notion of reversible dynamical systems. 
Another example of a class of dynamical systems with “special structure” 
that constrains the dynamics. Such systems often arise in applications (see 
Roberts and Quispel [1992] for many examples, as well as the exercises at 
the end of this chapter) and may have the seemingly peculiar property of 
simultaneously possessing Hamiltonian-like dynamics (e.g. KAM tori) and 
dissipative dynamics (e.g. attractors). We begin by defining reversibility in 
both the continuous and discrete time settings. Comprehensive references 
are Sevryuk [1986], [1992], and Roberts and Quispel [1992]. 



16.1 The Definition of Reversible Dynamical 
Systems 

Consider the following continuous and discrete time dynamical systems: 
x = f{x), Xn+l=g{Xn), XGK”, 
which we assume to be C”, r > 1. Next consider a C"" map 

G : M" R”, 



satisfying 



GoG = id, 



where id stands for the identity map. G is called an involution. Then a 
vector field is said to be reversible if 



±{G{x)) = -f{G{x)). (16.1.1) 

From this definition we can see the motivation for the term reversible. A 
vector field, x = f{x), is reversible if the dynamics on the phase space 
G • K” is given by the time reversed vector field. Using the chain rule on 
the left hand side of (16.1.1), as well as the identity x = f{x), we see 
that the definition of reversibility for a vector field reduces to the following 
relationship between G and /: 



DG-f = -foG, 



(16.1.2) 
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where in this section the notation means ordinary matrix multiplication, 
with a matrix to the left of • and a vector to the right of •. 

Next we consider maps. A map is said to be reversible if 

g{G{xi+i)) = G{xi). (16.1.3) 

The motivation for the term reversible should be apparent. In the trans- 
formed phase space, G-M”, g reverses the temporal direction of trajectories. 
Substituting Xi+\ = g{xi) into the left hand side of (16.1.3), we obtain the 
following relationship between G and g that is required for g to be re- 
versible: 

goGog = G. (16.1.4) 



16.2 Examples of Reversible Dynamical Systems 



We now consider some explicit examples of reversible dynamical systems. 

Example 16.2.1. Consider a canonical Hamiltonian vector field where the 
Hamiltonian is even in the momenta, i.e., 

H{q,p) = H{q,-p), (q,p) G ^ x R". 

Hamilton’s equations are given by 



q = 



dH 



p = - 



dH 
dq ' 



(16.2.1) 



It is easy to verify that the map 



G : {q,p) i-S- (q,-p), 

is an involution of and that (16.2.1) satisfies (16.1.1). 



End of Example 16.2.1 



Example 16.2.2. The following example is from Roberts and Quispel [1992]. 
Consider the vector field 



X = x(l — x), X € R. 

This vector field is reversible with respect to the involution 

G : X 1-^ 1 — X. 

Also, this vector field is not Hamiltonian, and it is not volume preserving. It has 
a repelling fixed point at a: = 0 and an attracting fixed point at a: = 1. 



End of Example 16.2.2 
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16.3 Linearization of Reversible Dynamical 
Systems 

Suppose a: = xo is a fixed point. We now consider the structure of reversible 
dynamical systems linearized about fixed points. 



16.3a Continuous Time 
L etting x = xo + C) (16.1.1) becomes 

— (G(a:o + 0) = ~f {G{xo + ?)) ■ 

Taylor expanding each side in ^ gives 

DG{xo + C (io + C) = -/ {G{xo) + DG{xo)^ + 0(1^1")) 

= -/ (G(xo)) - Df (G(xo)) DG{xo)^ + O (ICH) ■ 

It follows from (1.15.2) that if xq is a fixed point, then G(xq) is a fixed 
point, and therefore / (G(xo)) = 0. Then this relation becomes 

DG(xoU = -Df (G(xo)) DG{xq)^ 

+ o{m+o{\m\). 

Note that ^ = Df{xo)f, + G(|^P). Substituting this expression into the 
left-hand-side of the equation, and dropping the G(|^P) terms, we obtain 

DG{xo)Df{xo) = -Df (G(xo)) DG(xo). (16.3.1) 

Suppose G(xq) = Xq- Then xg is said to be a symmetric fixed point. For 
symmetric fixed points (16.3.1) has the form 

DG{xg)Df{xg) = -Df{xo)DG{xg), (16.3.2) 

and the matrix Df{xg) is said to be infinitesimally reversible. In general, 
we have the following definition. 

Definition 16.3.1 (Infinitesimally Reversible) Suppose G : K" M” 
is a linear involution and A : K" — >■ M" is a linear map. Then A is said to 
be infinitesimally reversible if 



AG + GA = 0. 



Note that this linearization condition is defined at a symmetric fixed point. 
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Note that we have not proven that if G is an involution, so is DG. This is 
true for DG evaluated at a symmetric fixed point, as can be easily verified 
with a simple calculation using the definition of involution and the chain 
rule that we leave to the reader. 

Sevryuk [1986], [1992] has studied the spectra of infinitesimally reversible 
linear maps. It can be shown that if A G C is an eigenvalue of an infinitesi- 
mally reversible linear map then so is —A. Moreover, all nonzero eigenvalues 
occur in real pairs (x, —x), a; G M, purely imaginary pairs, (iy, —iy), y G K, 
and quadruplets, ±x±iy. This is made precise in the following proposition 
(courtesy of Jerry Marsden). 

Proposition 16.3.2 Suppose A is an nxn infinitesimally reversible linear 
matrix. Then the characteristic polynomial p(A) of A satisfies 

p{-X) = (-l)XA), A G C. 

In particular, if X is an eigenvalue of A so is —A. If A is nonsingular, then 
n is even. In any case, the spectrum of A is symmetric with respect to the 
real and imaginary axes. 

Proof: By definition 16.3.1 we have AG + GA = 0, which implies 

GAG~^ = -A, 

and p(X) = det (A — Al), where 1 denotes the nxn identity matrix (note 
that G~^ = G since G is an involution). Using these two equations, we 
obtain 



p{X) = det {A — Al) 

= det (G(A- Al)G-i) 

= det {GAG~^ - Al) 

= det {-A - Al) 

= (-l)”det (A-b Al) 

= (-l)>(-A). 

It follows immediately that the spectrum of A is symmetric with respect 
to the real and imaginary axes. Moreover, if n is odd, then p is an odd 
function of A, and so 0 is an eigenvalue. Then if n is odd, A cannot be 
invertible. □ 

Note that the proof of this proposition is identical to the proof of Propo- 
sition 14.3.5 which described the eigenvalues of infinitesimally symplectic 
matrices. 
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16.3b Discrete Time 

We carry out the same procedure for maps as we carried out for vector 
fields above. Substituting a; = a;o + ^ into (16. 1.3) 

g o G{xo + Ci+i) = G{xo + ^i), 

and Taylor expanding in ^ gives 

g o G(xo) + Dg{G{xo)) ■ DG{xo)^,+i + D(|e*+iP) 

= G{xo) + DG{xo)^, + 0{\^i\^). 

If we substitute the following expression for into this expression 

^i+i = Dg{xo)^i, 

then the terms of order give us the following relation: 

Dg (G(xo)) DG{xo)Dg{xo) = DG(xo). (16.3.3) 

If xq is a symmetric fixed point (16.3.3) becomes 

Dg{xo)DG{xo)Dg{xo) = DG{xq), (16.3.4) 

and the matrix Dg{xo) is said to be reversible. In general, we have the 
following definition. 

Definition 16.3.3 (Reversible) Suppose G : K" — >■ K" is a linear invo- 
lution and A : K" — >■ R" is a linear map. Then A is said to be reversible 

if 

AG A = G. 

Note that this linearization condition is defined at a symmetric fixed point. 



Sevryuk [1986], [1992] has also studied the spectra of reversible linear 
maps. It can be shown that if A G C is an eigenvalue of a reversible linear 
map then so is Moreover, all eigenvalues different from 1 and —1 occur in 
real pairs {x, ^), a; G M, unitary pairs, (e**^,e“*^), y G K, and quadruplets, 
x±^^±^y ^ We prove this in the following proposition. 

Proposition 16.3.4 Suppose A : R" — >■ R” is a linear operator that is 
reversible with respect to the linear involution G : R” — >■ R" . Then if A 
is an eigenvalue of A so is A, and j. If A does not have 1 and —1 as 
eigenvalues, then n is even. 
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Proof: First, note that it follows from the relation AG A = G that det A = 
±1. Hence, 0 is not an eigenvalue of A. Moreover, AG A = G is equivalent 
to A = GA~^G, where we have used the fact that G is an involution, i.e., 
G = G~^. Using these results, we have 

p(A) = det {A — Al), 

= det (GH-^G- All) , 

= det (G (H-i - Al) G) , 

= det {A~^ — Al) , 

= det (H-i(l-AH)), 

= det A~^ det (1 — AH) , 

= ±1 det A 

= ±l(A»(i). 

Hence, it immediately follows that if A is an eigenvalue of H so is y. Since 
A is real, it also follows that A and i are also eigenvalues. From this is 
follows that if 1 and —1 are not eigenvalues, the eigenvalues occur in pairs 
or quartets. Hence n must be even. □ 




16.4 Additional Properties of Reversible 
Dynamical Systems 

We end this section with a few general remarks about reversible dynamical 
systems. Additional properties are developed in the exercises. 

1. We saw that near symmetric fixed points the spectra of the associ- 
cated linearized dynamical systems is very similar to that of Hamil- 
tonian dynamical systems. There are many more similarities between 
reversible dynamical systems and Hamiltonian systems. An extensive 
outline, with references, can be found in Sevryuk [1991]. Most no- 
tably, there is a KAM theory for reversible dynamical systems (but, 
interestingly enough, there is no analog of Nekhoroshev’s theorem) . 

2. Despite the similarities between reversible and Hamiltonian dynam- 
ical systems, reversible dynamical systems can simultaneously dis- 
play non-Hamiltonian behavior (odd-dimensionality notwithstand- 
ing). For example, numerical simulations of dynamical systems that 
display KAM- type behavior, as well as possess attractors and re- 
pellers, can be found in Roberts and Quispel [1992] and Politi et al. 
[1986]. 
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16.5 Exercises 

Exercises 1-4 are examples given in the review paper of Roberts and Quispel [1992]. 

1. The following vector field arises in laser physics (Arecchi [1987]): 

X — zx y Cl , 
y ^ zy - X, 

z — C 2 — x^ — {x, y, z) G 



where Ci and C 2 are parameters. Prove that this is a reversible vector field. Is the 
vector field volume-preserving? 

2. The following vector field arises in the study of non-equilibrium thermodynamics 
(Hoover et al. [1987]): 



X ^ y, 

y = F — e sin x — zy, 

z — a{y^ — 1), {x, y, z) G 



where F, a, and e are parameters. Prove that this is a reversible vector field. 

3. The following vector field is an idealized model describing the sedimentation of small 
particles in a fiuid (see Golubitsky et al. [1991]): 

n-2 

n ^ ^ [U{ri+i H h n+fc) - H(n_i H h n_fc)] , i ^ 1, . . . , n, 

fc=i 



where 



U(r) = -j-r + — 

k kk 



the indices are taken mod n, ri , . . . , denote the consecutive edges of an n- 

sided polygon in and is a unit vector in the z direction (in which gravity is 
acting). Prove that this is a volume-preserving, reversible vector field. 

4. The following vector field models a series array of Josephson junctions in parallel with 
a single resistor (Tsang et al. [1991]): 



1 

Ok = n -|- a cos 9k -\ \ cos 9i. k — 1, . . . , N . 

N “ 

where 9k are the phase angles (defined mod 27 t), and Q, and a are parameters. Prove 
that this is a reversible vector field. 

5 

6 

H oG, H o H ^id. 



Suppose G : R”" is a C’’, t > 1, involution. Prove that at a symmetric fixed 

point DG is also an involution. 

Suppose a map g : —¥ R”' is reversible with respect to the involution G : R^ —¥ R”". 

Then show that g can be written as the product of two involutions, i.e., 



7. Suppose a map g : R^ —¥ R”' is reversible with respect to the involution G : R^ —¥ R”". 
Then show that g is invertible. 
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8 . Consider the so-called Chirikov-Taylor or Standard map: 

X — X y, 

/ ^ ^ 2 

y — y sin 27TX , [x^ y) G K , 

27T 



where K is a, parameter. Show that it can be written as the product of two involutions 

H oG. 

9. Consider the following area-preserving Henon map: 

x' ^ y, 

y' = -X + 2Cy + 2y"^ , (a;,y)eR^, 

where C is a parameter. Show that it can be written as the product of two involutions 
H oG. 

10. Suppose a map g : R”" is reversible with respect to the involution G : —¥ R”". 

Suppose r C R”^ is an invariant set for g, i.e., y(r) = F. Then show that G(F) is also 
an invariant set for g. 

11 . Suppose a map g : R^ —¥ R^ is reversible with respect to the involution G : R”^ —¥ 

R^. Let a^o be a hyperbolic saddle point for g with stable and unstable manifolds 
denoted W^{xo) and W^{xq), respectively. Show that G(W^(xo)) = (G(xo)) and 

G(W'^{xo)) = W‘- (G{xo)) 

12 . Suppose a map g : R^ —¥ R”" is reversible with respect to the involution G : R^ —¥ R”". 
Let xq be a hyperbolic saddle point for g with stable and unstable manifolds denoted 
W^{xq) and W^{xq), respectively. Let Fix(G) denote the fixed point set of G and 
suppose W^{xq) intersects Fix(G). If G(xo) = xq show that the intersection point is a 
homoclinic point and if G{xq) 7 ^ xq show that the intersection point is a heteroclinic 
point. 

13. Suppose a map g : R^ R”" is reversible with respect to the involution G : R^ R”". 
Show that reversibility is preserved under conjugacy. 

14. Suppose the , r > 1 vector field x — f{x), x G R^, is reversible with respect to 
the involution G : R^ R”". Let 4>t{') denote the fiow generated by this vector field. 
Show that 4>t{') is a one-parameter family of reversible diffeomorphisms. 

15. Show that a linear map represented by a symplectic matrix is reversible. 

16. Show that a linear map represented by an infinitesimally symplectic matrix is infinites- 
imally reversible. 
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Asymptotically Autonomous 
Vector Fields 



Imagine a system that is subjected to some type of time-dependent distur- 
bance that “dies away” in time. Mathematically, this might be represented 
by a nonautonomous vector field that becomes autonomous as time in- 
creases. Now many dynamical properties are only defined in the limit as 
t — >■ oo. Therefore, in order to study the long time behavior of such a system 
would it be sufficient to study the long time behavior of the autonomous 
vector field obtained in the limit as t — >■ oo? This is one of the questions we 
will address in our discussion in this chapter of asymptotically autonomous 
vector fields, which we now define more precisely. 

Consider the following nonautonomous vector field 

x = f{x,t), x G K", (17.0.1) 

which we assume to be C”, r > 1 (this can be weakened to continuity 
in t, and locally Lipschitz in x). Suppose that (17.0.1) is asymptotically 
autonomous in the sense that 



f{x,t)^g{x), t^oo, 



where the convergence is locally uniform in a; G K", i.e., convergence occurs 
for any x in any compact subset of M". We denote the autonomous limit 
equation by 

x = g{x), xGK”. (17.0.2) 

For both (17.0.1) and (17.0.2) we assume that the trajectory through any 
point x G K" exists for all positive time. 

It is natural to conjecture that the asymptotic behavior of trajectories 
of (17.0.1) can be inferred from the asymptotic behavior of trajectories of 
(17.0.2). The first results along these lines were obtained by Markus [1956]. 
We summarize his results, but first we begin with a definition. 



Definition 17.0.1 (w Limit Set of A Nonautonomous System) 

Suppose x{t,to,Xo) is the solution of (17.0.1) satisfying x(to, tg, xq) = xq. 
A point y is said to be an uj limit point of x{t,to,xo), denoted oj(xo,to), if 
there exists a sequence of times {tj}, tj — >■ oo as j — >■ oo, such that 



lim 

j->oo 



x{tj,to,xo) = y. 
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The set of all to limit points of a given trajectory is called the omega limit 
set of the trajectory. The set of all uj limit sets for all trajectories is called 
the omega limit set of the system. 

We now state Markus’s main results on this problem. 

Theorem 17.0.2 (Markus) The lj limit set uj of a trajectory x{t,to,xo) 
of (17.0.1) is non-empty, compact, and connected, uj attracts x{t,tQ,XQ) in 
the sense that 

dist (x(t,to,xo),uj) ^ 0, t — >■ oo, 

where dist{-,-) is a metric onM". Moreover, uj is invariant under (17.0.2). 

Proof: See Markus [1956]. □ 

Note that this result does not imply that uj limit sets of (17.0.1) are 
unions of u> limit sets of (17.0.2). Thieme [1994] gives several examples 
showing that this is not the case. 

Theorem 17.0.3 (Markus) Suppose x is an asymptotically stable equi- 
librium of (17.0.2) and letuo denote theuj limit set of a trajectory x(t, to, xq) 
of (17.0.1). If UJ contains a point x such that a trajectory of (17.0.2) start- 
ing at X converges to x as t ^ go then u = x, i.e., x{t,to,xo) x as 
t — >■ oo. 

Proof: See Markus [1956]. □ 

If one restricts attention to two-dimensional systems, then an analog of 
the Poincare-Bendixson theorem is available. 

Theorem 17.0.4 (Markus) Suppose n = 2, and let uj denote the uj limit 
set of a trajectory x(t,to,xo) of (17.0.1). Then uj contains at least one 
equilibrium of (17.0.2), or uj is the union of periodic orbits of (17.0.2). 

Proof: See Markus [1956]. □ 

Thieme [1992], [1994] has strengthened this result. Moreover, his paper 
describes a variety of applications where asymptotically autonomous equa- 
tions arise. 

Theorem 17.0.5 (Thieme) Let n = 2, and let uj denote the uj limit set of 
a trajectory x(t,to,xo) of (17.0.1). Assume that there exists a neighborhood 
of UJ which contains at most finitely many equilibria of (17.0.2). Then one 
of the following holds. 

1. UJ consists of an equilibrium point of (17.0.2). 

2. UJ is the union of periodic orbits of (17.0.2), and possibly of center 
type equilibria of (17.0.2) that are surrounded by periodic orbits of 
(17.0.2) lying in uj. 




244 



17. Asymptotically Autonomous Vector Fields 



3. ui consists of equilibria of (17.0.2) that are cyclically connected to 
each other in a heteroclinic cycle. In the case of only one equilibrium 
it would be eonnected by a homoclinic orbit. 

Proof: See Thieme [1992]. □ 

Further information on asymptotically autonomous systems can be found 
in Thieme [1994], as well as the original work of Markus [1956]. Thieme 
[1994] contains a number of examples that appear to be somewhat counter- 
intuitive. Holmes and Stuart [1992] study the existence of homoclinic orbits 
in asymptotically autonomous vector fields. Asymptotically autonomous 
equations also naturally arise in the study of the approach of trajecto- 
ries to a low dimensional invariant manifold in autonomous systems, see 
Robinson [1996]. 



17.1 Exercises 



1. Consider the following asymptotically autonomous vector field in the plane: 

X ^ y, 

y — X — — Sy -\- 7 e“*, {x, y) G M.^, (5, 7 > 0. 



Describe the uj limit sets for trajectories. 

2. This example is from Thieme [1994]. Consider the following planar vector field 

X ^ {-X (1 - a:) + y) (2 + a:) , 

y^-y. (17.1.1) 



The y component of the vector field is independent of the x component and the solution 
of this can be solved and substituted into the x component to yield the following 
asymptotically autonomous system 



X — ^ — a: (1 — a:) + yoe (2 + a:) , (17.1.2) 

and the asymptotically autonomous limit of this equation is 

X — —X (1 — a:) (2 + a:) . (17.1.3) 

(a) Show that a? = 0 is an attracting equilibrium of (17.1.3), and that the basin of 
attraction is the open interval between a: = 1 and x — —2. 

(b) For (17.1.2), show that the basin of attraction of a: = 0 becomes arbitrarily small 
as yo is chosen arbitrarily large. 
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Center Manifolds 



When one thinks of simplifying dynamical systems, two approaches come to 
mind: one, reduce the dimensionality of the system and two, eliminate the 
nonlinearity. Two rigorous mathematical techniques that allow substantial 
progress along both lines of approach are center manifold theory and the 
method of normal forms. These techniques are the most important, gener- 
ally applicable methods available in the local theory of dynamical systems, 
and they will form the foundation of our development of bifurcation theory 
in Chapters 20 and 21. 

The center manifold theorem in finite dimensions can be traced to the 
work of Pliss [1964], Sositaisvili [1975], and Kelley [1967]. Additional valu- 
able references are Guckenheimer and Holmes [1983], Hassard, Kazarinoff, 
and Wan [1980], Marsden and McCracken [1976], Carr [1981], Henry [1981], 
and Sijbrand [1985]. 

The method of normal forms can be traced to the Ph.D thesis of Poincare 
[1929]. The books by van der Meer [1985] and Bryuno [1989] give valuable 
historical background. 

Let us begin our discussion of center manifold theory with some motiva- 
tion. Consider the linear systems 

x = Ax, (18.0.1) 

XI — >Ax, xGK", (18.0.2) 

where A is an n x n matrix. Recall from Chapter 3 that each system has in- 
variant subspaces A®, A“, corresponding to the span of the generalized 
eigenvectors, which in turn correspond to eigenvalues having 

Flows: negative real part, positive real part, and zero real part, respec- 
tively. 

Maps: modulus < 1, modulus > 1, and modulus = 1, respectively. 

The subspaces were so named because orbits starting in A® decayed to zero 
as t (resp. n for maps) f oo, orbits starting in became unbounded as t 
(resp. n for maps) f oo, and orbits starting in E‘^ neither grew nor decayed 
exponentially as t (resp. n for maps) f oo. 

If we suppose that = 0, then we find that any orbit will rapidly decay 
to Thus, if we are interested in long-time behavior (i.e., stability) we 
need only to investigate the system restricted to 
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It would be nice if a similar type of “reduction principle” applied to the 
study of the stability of nonhyperbolic fixed points of nonlinear vector fields 
and maps, namely, that there were an invariant center manifold passing 
through the fixed point to which the system could be restricted in order to 
study its asymptotic behavior in the neighborhood of the fixed point. That 
this is the case is the content of the center manifold theory. 



18.1 Center Manifolds for Vector Fields 

We will begin by considering center manifolds for vector fields. The set-up 
is as follows. We consider vector fields of the following form 

x = Ax + f{x,y), 

y=By + g{x,y), {x,y)GR‘^xR% (18.1.1) 

where 

/(0,0) = 0, A/(0,0) = 0, 

5(0,0) = 0, Z?5(0,0) = 0. (18.1.2) 

(See Chapter 3 for a discussion of how a general vector field is transformed 
to the form of (18.1.1) in the neighborhood of a fixed point.) 

In the above, ^ is a c x c matrix having eigenvalues with zero real parts, 
i? is an s X s matrix having eigenvalues with negative real parts, and / and 
g are C” functions (r > 2). 

Definition 18.1.1 (Center Manifold) An invariant manifold will he 
called a center manifold for (18.1.1) if it can locally be represented as fol- 
lows 

W^(0) = {{x,y) xW \ y = h{x), |a;| < ,5, /i(0) = 0, Dh{0) = 0 } 
for 6 sufficiently small. 

We remark that the conditions h{0) = 0 and Dh{0) = 0 imply that 
W‘^{0) is tangent to E‘^ at (x,y) = (0,0). The following three theorems are 
taken from the excellent book by Carr [1981]. 

The first result on center manifolds is an existence theorem. 

Theorem 18.1.2 (Existence) There exists a C” center manifold for 
(18.1.1). The dynamics of (18.1.1) restricted to the center manifold is, 
for u sufficiently small, given by the following c-dimensional vector field 

u = Au-\- f{u,h{u)), M G M'^. (18.1.3) 
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Proof: See Carr [1981]. □ 



The “u” Notation. Since the center manifold of an equilibrium point 
is locally represented as a graph, i.e., y = h(x), the reader may be 
wondering why we substituted u for x in the restriction of the vector field 
to the center manifold given in (18.1.3). This is to emphasize that the 
restriction of the vector field to the center manifold is, generally, a vector 
field on a nonlinear surface. If we had used x, since {x, y) G x are 
the original coordinates for the vector field, this point might have been 
be obscured. Once this point of interpretation is understood, there is 
no harm in using x (or, for that matter, any other symbol), and this is 
typically done in the literature. 



The next result implies that the dynamics of (18.1.3) near u = 0 deter- 
mine the dynamics of (18.1.1) near {x,y) = (0,0). 

Theorem 18.1.3 (Stability) i) Suppose the zero solution of (18.1.3) is 
stable (asymptotically stable) (unstable); then the zero solution of (18.1.1) 
is also stable (asymptotically stable) (unstable), ii) Suppose the zero solu- 
tion of (18.1.3) is stable. Then if {x(f),y{tf) is a solution of (18.1.1) with 
(x(0),y(0)) sufficiently small, there is a solution u(t) of (18.1.3) such that 
as t ^ CO 



x{f) = u{t) -h 0(e 
y{t) = h{u{t)) 

where 7 > 0 zs o constant. 

Proof: See Carr [1981]. □ 

Dynamics Captured by the Center Manifold 

Stated in words, this theorem says that for initial conditions of the full 
system sufficiently close to the origin, trajectories through them asymp- 
totically approach a trajectory on the center manifold. In particular, equi- 
librium points sufficiently close to the origin, sufficiently small amplitude 
periodic orbits, as well as “small” homoclinic and heteroclinic orbits are 
contained in the center manifold. 

The obvious question now is how do we compute the center manifold so 
that we can reap the benefits of Theorem 18.1.3? To answer this question 
we will derive an equation that h{x) must satisfy in order for its graph to 
be a center manifold for (18.1.1). 

Suppose we have a center manifold 

TT'^(O) = {(x, y) G X M® I y = h{x), |x| < S, h{0) = 0, Dh{0) = 0}, 

(18.1.4) 




248 



18. Center Manifolds 



with S sufficiently small. Using invariance of fU'^(O) under the dynamics 
of (18.1.1), we derive a quasilinear partial differential equation that h(x) 
must satisfy. This is done as follows: 

1. The (x,y) coordinates of any point on fU°(0) must satisfy 

y = h(x). (18.1.5) 

2. Differentiating (18.1.5) with respect to time implies that the (x,y) 
coordinates of any point on fU'^(O) must satisfy 

y = Dh(x)x. (18.1.6) 

3. Any point on fU°(0) obeys the dynamics generated by (18.1.1). There- 
fore, substituting 

X = Ax + f[x,h{x)), (18.1.7) 

y = Bh{x) + g{x,h{x)) (18.1.8) 



into (18.1.6) gives 

Dh{x)\Ax + f {x,h{x)y\ = Bh{x) + g{x,h{x)^ (18.1.9) 



or 



N{h{x)) = Dh{x) '[Ax + /(x, ft^(x))] — Bh{x) — g{x, h{x)) = 0. 

(18.1.10) 

Equation (18.1.10) is a quasilinear partial differential equation that h{x) 
must satisfy in order for its graph to be an invariant center manifold. To 
find a center manifold, all we need do is solve (18.1.10). 

Unfortunately, it is probably more difficult to solve (18.1.10) than our 
original problem; however, the following theorem gives us a method for 
computing an approximate solution of (18.1.10) to any desired degree of 
accuracy. 

Theorem 18.1.4 (Approximation) Let (/> : — >■ M® be a mapping 

with (j){0) = D(j){0) = 0 such that Af[(j){x)) = 0{\x\^) as x 0 for some 
q > 1. Then 

\h(x) — 4){x)\ = 0(|x|*) as X — >■ 0. 

Proof: See Carr [1981]. □ 

This theorem allows us to compute the center manifold to any desired 
degree of accuracy by solving (18.1.10) to the same degree of accuracy. 
For this task, power series expansions will work nicely. Let us consider a 
concrete example. 
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Example 18.1.1. Consider the vector field 

2 5 

X = X y — X , 

y = -y + x'^, {x,y)eM.^. (18.1.11) 

The origin is obviously a fixed point for (18.1.11), and the question we ask is 
whether or not it is stable. The eigenvalues of (18.1.11) linearized about (x,y) = 
(0,0) are 0 and —1. Thus, since the fixed point is not hyperbolic, we cannot 
make any conclusions concerning the stability or instability of {x, y) = (0, 0) 
based on linearization (note: in the linear approximation the origin is stable but 
not asymptotically stable). We will answer the question of stability using center 
manifold theory. 

From Theorem 18.1.2, there exists a center manifold for (18.1.11) which can 
locally be represented as follows 

1T“(0) = { (*, j/) € I y = h{x), \x\ < 5, h{Q) = Dh{Q) = 0 } (18.1.12) 

for 5 sufficiently small. We now want to compute W’^{G). We assume that h{x) 
has the form 

h(x) = ax^ + hx^ + 0{x'*^), (18.1.13) 

and we substitute (18.1.13) into equation (18.1.10), which h(x) must satisfy to 
be a center manifold. We then equate equal powers of x, and in that way we can 
compute h(x) to any desired order of accuracy. In practice, computing only a few 
terms is usually sufficient to answer questions of stability. 

We recall from (18.1.10) that the equation for the center manifold is given by 

M (h{x)) = Dh{x) [Ax + / {x, h{x))] — Bh{x) — g (x, h{x)) = 0, (18.1.14) 

where, in this example, we have {x, y) G 

^ = 0 , 

B = -l, 

f{x,y) = x^y-x^, 

g{x,y) = x^. (18.1.15) 

Substituting (18.1.13) into (18.1.14) and using (18.1.15) gives 

N {h{x)) = [2ax + ibx^ + • • ■)(ax‘^ + bx^ — x® + • • •) 

+ ax^ + bx^ — x^ + ■ ■ ■ = 0. (18.1.16) 

In order for (18.1.16) to hold, the coefficients of each power of x must be zero; 
see Exercise 2. Thus, equating coefficients on each power of x to zero gives 

x^:a — l = 0^a=l, 
x^ : b = 0, 



(18.1.17) 



and we therefore have 



h{x) = + 0(x'^). 



(18.1.18) 
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y 




Using (18.1.18) along with Theorem 18.1.2, the vector field restricted to the center 
manifold is given by 

x = x‘^ + 0{x^). (18.1.19) 

For X sufficiently small, a; = 0 is thns unstable in (2.1.19). Hence, by Theorem 
18.1.2, {x,y) = (0,0) is unstable in (18.1.11); see Fignre 18.1.1 for an illustration 
of the geometry of the flow near {x, y) = (0, 0). 

This example illnstrates an important phenomenon, which we now describe. 

The Failure of the Tangent Space Approximation 

The idea is as follows. Consider (18.1.11). One might expect that the y compo- 
nents of orbits starting near {x, y) = (0, 0) should decay to zero exponentially 
fast. Therefore, the qnestion of stability of the origin should reduce to a study 
of the X component of orbits starting near the origin. One might thus be very 
tempted to set y = 0 in (18.1.11) and study the reduced equation 

x = -x^. (18.1.20) 

This corresponds to approximating 1F“(0) by E‘^. However, a: = 0 is stable for 
(18.1.20) and, therefore, we would arrive at the wrong conclusion that (x,y) = 
(0,0) is stable for (18.1.20). The tangent space approximation might sometimes 
work, but, as this example shows, it does not always do so. 

End of Example 18.1.1 

Example 18.1.2. The previous example showed a situation where an equi- 
librium point was unstable, but the tangent space approximation to its center 
manifold indicated that it was stable. One could ask the following question. “Sup- 
pose the equilibrium point is stable, will the tangent space approximation to the 
center manifold also show stability”? Here we give an example showing that the 
answer is “no”. 
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Consider the vector field 

X = —xy — a;®, 

y = -y + x'^, {x,y)€R'^. 

The origin is an equilibrium point, and the eigenvalues of the matrix associated 
with the linearization are 0 and —1. The tangent space to the center manifold is 
the X axis. Hence, the restriction of the vector field to the center manifold in the 
tangent spaee approximation is given by 

X = —a:®. 



for which the origin is unstable. 

The center manifold can be calculated, and it is given by the graph of the 
following function 

h{x) = x^ + 0(4). 

The vector field restricted to the center manifold is given by 

X = —x^ + 0(5), 

which indicates that the origin is stable. 



End of Example 18.1.2 



18.2 Center Manifolds Depending on Parameters 

Suppose (18.1.1) depends on a vector of parameters, say e G In this 
case we write (2.1.2) in the form 

X = Ax + f(x,y,e), 

y = By + g(x,y,e), (x,y,e) G R"" x R‘' x (18.2.1) 

where 

/(0,0,0) = 0, 7^/(0, 0,0)=0, 

5 ( 0 , 0 , 0 ) = 0 , Dg(0,0,0) = 0, 

and we have the same assumptions on A and B as in (18.1.1), with / and g 
also being C’’ (r > 2) functions in some neighborhood of (x, y, e) = (0, 0, 0). 
An obvious question is why do we not allow the matrices A and B to depend 
on el This will be answered shortly. 

The way in which we will handle parametrized systems is to include the 
parameter e as a new dependent variable as follows 

x = Ax + f(x,y,e), 

e = 0, 

y = By + g(x,y,e), (x, £, y) G x x (18.2.2) 
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At first glance it might appear that nothing is really gained from this action, 
but we will argue otherwise. 

Let us suppose we are considering (18.2.2) afresh. It obviously has a fixed 
point at (x,s,y) = (0,0,0). The matrix associated with the linearization 
of (18.2.2) about this fixed point has c + p eigenvalues with zero real part 
and s eigenvalues with negative real part. Now let us apply center manifold 
theory. Modifying Definition 18.1.1, a center manifold will be represented 
as a graph over the x and s variables, i.e., the graph of h{x,e) for x and e 
sufficiently small. Theorem 18.1.2 still applies, with the vector field reduced 
to the center manifold given by 

u = Au + f{u,h{u,£),e), 

e = 0, (w,e) G X RP. (18.2.3) 

Theorems 18.1.3 and 18.1.4 also follow (we will worry about any modifi- 
cations to computing the center manifold shortly). Thus, adding the pa- 
rameter as a new dependent variable merely acts to augment the matrix A 
in (18.1.1) by adding p new center directions that have no dynamics, and 
the theory goes through just the same. However, there is a new concept 
which will be important when we study bifurcation theory, namely, the cen- 
ter manifold exists for all e: in a sufficiently small neighborhood of e: = 0. 
We will learn in Chapters 20 and 21 that it is possible for solutions to be 
created or destroyed by perturbing nonhyperbolic fixed points. Thus, since 
the invariant center manifold exists in a sufficiently small neighborhood in 
both X and e of {x, e) = (0, 0), all bifurcating solutions will be contained in 
the lower dimensional center manifold. 

Let us now worry about computing the center manifold. From the exis- 
tence theorem for center manifolds, locally we have 

lTi"o^(0) = { (x,£,j/) G R° X RP X R" I y = h{x,e), |a;| < 5, 

|e| < 5, h{Q, 0) = 0, Dh{Q, 0) = 0 } (18.2.4) 

for S and 6 sufficiently small. Using invariance of the graph of h{x, e) under 
the dynamics generated by (18.2.2) we have 

y = D^h{x, s)x + Dgh{x, e)s = Bh{x, s) + g (x, h{x, s),e) . (18.2.5) 

However, 

X = Ax + f (x, h{x, e), e) , 

e = 0; (18.2.6) 

hence substituting (18.2.6) into (18.2.5) results in the following quasilinear 
partial differential equation that h{x,e) must satisfy in order for its graph 
to be a center manifold. 

Af {h{x, e)) = D^h{x, e) [Ax + f {x, h{x, £),£)] 

— Bh{x, e) — g (x, h(x, e),e) = 0. (18.2.7) 
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Thus, we see that (18.2.7) is very similar to (18.1.10). 

Before considering a specific example we want to point out an important 
fact. By considering £ as a new dependent variable, terms such as 



X^Sj , 


1 < i < c, 


1 < J < P, 




1 < i < s, 


1<J<P, 



become nonlinear terms. In this case, returning to a question asked at the 
beginning of this section, the parts of the matrices A and B depending 
on e are now viewed as nonlinear terms and are included in the / and g 
terms of (18.2.2), respectively. We remark that in applying center manifold 
theory to a given system, it must first be transformed into the standard 
form (either (18.1.1) or (18.2.2)). 

Example 18.2.1 (The Lorenz Equations). Consider the Lorenz equations 
x = a{y- x), 

y = px + x — y — xz, {x,y,z)€R^, (18.2.8) 

i = ~l3z + xy, 



where a and /3 are viewed as hxed positive constants and p is a parameter (note: 
in the standard version of the Lorenz equations it is traditional to put p — p — 1). 
It should be clear that (x,y,z) = (0,0,0) is a hxed point of (18.2.9). Linearizing 
(18.2.9) about this hxed point, we obtain the associated matrix 



/ —a a 0 \ 

1 -1 0 . (18.2.9) 

V 0 0 -13 J 



(Note: recall, px is a nonlinear term.) 

Since (18.2.9) is in block form, the eigenvalues are particularly easy to compute 
and are given by 

0,-0- -1,-d, (18.2.10) 



with eigenvectors 




(18.2.11) 



Our goal is to determine the nature of the stability of {x, y, z) = (0, 0, 0) for p 
near zero. First, we must put (18.2.9) into the standard form (18.2.2). Using the 
eigenbasis (18.2.11), we obtain the transformation 



with inverse 





1 

1+0 



o 

-1 

0 



:) 

1 + o ) 




(18.2.12) 



(18.2.13) 
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which transforms (18.2.9) into 




/O 0 0 

= 0 -(1 + cr) 0 

\0 0 

^ ( ap{u + av) — aw(u + av ) ' 

—p(u + av) + w(u + av) 
(1 + a)(u + av)(u — v) ^ 

p = 0. 



+ 



1 + cr 



(18.2.14) 



Thus, from center manifold theory, the stability of (x, y, z) = (0, 0, 0) near p = 0 
can be determined by studying a one-parameter family of first-order ordinary 
differential equations on a center manifold, which can be represented as a graph 
over the u and p variables, i.e., 



W’^pS) = I {u,v,w,p) G I -y = hi{u,p),w = /i 2 (w, p), 
hi{0, 0) = 0, DhiiO, 0) = 0, i = 1, 2 j 
for u and p sufficiently small. 



(18.2.15) 



We now want to compute the center manifold and derive the vector field on 
the center manifold. Using Theorem 18.1.4, we assume 



hi{u,p) = aiu^ + a2up + asp^ + ■ 
h2 (m, P) = -I- b2Up + b3p'^ + ■ 



(18.2.16) 



Recall from (2.1.27) that the center manifold must satisfy 

e)^ = Dxh(x, e) [Ax + f(x, h{x, e), e)] 



where, in this example, 

X = u, 

A = 0, 



— Bh{x, e) — g(x, h(x, e),s) = 0, (18.2.17) 

y=(v,w), e = p, h = (hi,h 2 ), 



B = 
f{x,y,e) = 
g{x,V,e) = 



-(l + a) 0 
0 -p 

[ap(u + av) — aw(u + av)], 



(18.2.18) 



l + a 
1 

l + a 



—p(u -I- av) + w(u + av) 
( 1 -I- cr ) (m -I- av){u — v) 



Substituting (18.2.16) into (18.2.17) and using (18.2.19) gives the two components 
of the equation for the center manifold. 



(2aiM -I- 02P H ) 



1 -I- cr 



^p(m -I- ahi) — h 2 (u + ahi)^ 



-I- (1 -I- a)hi + — (m -I- ahi) — (u + ahi) = 0, 



1 -I- cr 



1 -I- cr 




18.2 Center Manifolds Depending on Parameters 255 



(26im + 62 P H ) 



+ '^^ 1 ) - h2{u + ahi)J 



+ Ph 2 — (m + ahi){u — hi ) = 0. 
Eqnating terms of like powers to zero gives 

: ai(l + u) = 0 ^ ai = 0, 

1 

P' 

up : {1 + a)u2 + 11-^ — = 0 ^ 02 = 



1 + CT 

Pb 2 = 0 62 = 0. 

Then, using (18.2.21) and (18.2.16), we obtain 

1 



(l + a)2 



hi{u,p) = - 



(1 + cr)2 



up + ■ 



h 2 {u,p) = + • 



(18.2.19) 



(18.2.20) 



(18.2.21) 



Finally, substituting (18.2.21) into (18.2.14) we obtain the vector field reduced 
to the center manifold 



1 + cr 

p = 0. 



M P- +• 



(18.2.22) 




In Figure 18.2.1 we plot the fixed points of (18.2.22) neglecting higher order terms 
such as O(p^), 0{up^), 0{u^), etc. It should be clear that m = 0 is always a fixed 
point and is stable for p < 0 and unstable for p > 0. At the point of exchange of 
stability (i.e., p = 0 ) two new stable fixed points are created and are given by 



P = 



1 

P 



(18.2.23) 
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A simple calculation shows that these fixed points are stable. In Chapter 20 we 
will see that this is an example of a pitchfork bifurcation. 

Before leaving this example two comments are in order. 

1. Figure 18.2.1 shows the advantage of introducing the parameter as a new 
dependent variable. In a full neighborhood in parameter space new soln- 
tions are “captured” on the center manifold. In Fignre 18.2.1, for each fixed 
p we have a flow in the u direction; this is represented by the vertical lines 
with arrows. 

2. We have not considered the effects of the higher order terms in (18.2.22) 
on Figure 18.2.1. In Chapter 20 we will show that they do not qualitatively 
change the figure (i.e., they do not create, destroy, or change the stability 
of any of the fixed points) near the origin. 

End of Example 18.2.1 



18.3 The Inclusion of Linearly Unstable Directions 

Suppose we consider the system 

X = Ax + f(x,y,z), 
y = By + g{x,y,z), 

z = Cz + h{x,y, z), (a:, y, z) G X K® X M“, (18.3.1) 

where 

/(0,0,0) = 0, 7^/(0,0,0)=0, 

5(0,0,0) = 0, 7^g(0,0,0) = 0, 

/t(0,0,0) = 0, L»/t(0,0,0) = 0, 

and /, g, and h are C^{r > 2) in some neighborhood of the origin, A is a 
c X c matrix having eigenvalues with zero real parts, B is an s x s matrix 
having eigenvalues with negative real parts, and C is a, uxu matrix having 
eigenvalues with positive real parts. 

In this case {x, y, z) = (0, 0, 0) is unstable due to the existence of a 
u-dimensional unstable manifold. However, much of the center manifold 
theory still applies, in particular Theorem 18.1.2 concerning existence, with 
the center manifold being locally represented by 

W^{0) = {(x,j/,z) G X R® X K“ I y = hi{x),z = /i 2 (x), 

/ii(0) = 0,Zlhi(0) = 0,f = 1,2} (18.3.2) 

for X sufficiently small. The vector field restricted to the center manifold is 
given by 



ii = Au + f[u, hi{u), h 2 {u)), 



M G K' 



(18.3.3) 
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Using the fact that the center manifold is invariant under the dynamics 
generated by (18.3.1), we obtain 

X = Ax + f[x, hi{x), h 2 {x)), 

y = Dh\{x)x = Bhi{x) + g{x, hi{x), h 2 {x)), 

z = Dh 2 {x)x = Ch 2 {x) + h{x, hi{x), h 2 {x)) , (18.3.4) 

which yields the following quasilinear partial differential equation for hi{x) 
and h 2 {x) 



Dhi{x) [Ax + f{x, hi{x),h 2 {x))] 

- Bhi{x) - g{x, hi{x),h 2 {x)) = 0, 

Dli 2 {x) [Ax + f[x, hi{x) , h 2 {x))] 

— Ch2{x) — h{x,hi{x),h2{x)) = 0. (18.3.5) 

Theorem 18.1.4 also holds in order that we may justify solving (18.3.5) 
approximately via power series expansions. We can also include parameters 
in exactly the same way as in Section 18.2. 

Of course, Theorem 18.1.3 does not hold as a result of the presence of 
the exponentially linearly unstable directions. Nevertheless, the formulation 
of the theory with the inclusion of the linearly unstable directions is still 
useful. It is often important to know the nature of solutions having saddle- 
type stability since their stable manifolds may play a role in forming the 
boundaries of the basins of attraction of attracting sets. In the context 
of bifurcation theory, the creation of unstable solutions may be important 
since it may be possible for them to undergo secondary bifurcations and, 
consequently, become stable. 



18.4 Center Manifolds for Maps 

The center manifold theory can be modified so that it applies to maps 
with only a slight difference in the method by which the center manifold is 
calculated. We outline the theory below. 

Suppose we have the map 

X I — Ax + f{x,y), 

y\ — >By + g{x,y), {x,y)€R‘'xR\ (18.4.1) 



or 



— xAXn f {Xn , ^n) j 

yn-\-l — Byn ^ 71)5 
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where 

/(0,0) = 0, Df{0,0) = 0, 

g(0,0) = 0, Dg{0,0) = 0, 

and / and g are C'’(r > 2) in some neighborhood of the orgin, A is a c x c 
matrix with eigenvalues of modulus one, and B is an s x s matrix with 
eigenvalues of modulus less than one. 

Evidently {x, y) = (0, 0) is a fixed point of (18.4.1), and the linear approx- 
imation is not sufficient for determining its stability. We have the following 
theorems, which are completely analogous to Theorems 18.1.2, 18.1.3, and 
18.1.4. 

Theorem 18.4.1 (Existence) There exists a C’’ center manifold for 
(18.4-1) which can he locally represented as a graph as follows 

W‘=(0) = {(a;, y) G X I j/ = h{x), |a;| < 6, h{0) = 0, Dh{0) = 0} 

(18.4.2) 

for S sufficiently small. Moreover, the dynamics of (18.4-1) restricted to 
the center manifold is, for u sufficiently small, given by the c-dimensional 
map 

ui — > Au + f(^u,h{u)), M G (18.4.3) 

Proof: See Carr [1981]. □ 

The next theorem allows us to conclude that (x,y) = (0,0) is stable or 
unstable based on whether or not u = 0 is stable or unstable in (18.4.3). 

Theorem 18.4.2 (Stability) i) Suppose the zero solution of (18. 4-3) is 
stable (asymptotically stable) (unstable). Then the zero solution of (18. 4-1) 
is stable (asymptotically stable) (unstable), ii) Suppose that the zero solution 
of (18.4-3) is stable- Let {xn,yn) he a solution of (18-4-1) with (xo,yo) 
sufficiently small. Then there is a solution Un of (18-4-3) such that |a;„ — 
Un\ < /c/3” and [?/„ — h{u-n)\ < kjS'^ for all n where k and (3 are positive 
constants with /3 < 1. 

Proof: See Carr [1981]. □ 

Next we want to compute the center manifold so that we can derive 
(18.4.3). This is done in exactly the same way as for vector fields, i.e., by 
deriving a nonlinear functional equation that the graph of h{x) must satisfy 
in order for it to be invariant under the dynamics generated by (18.4.1). In 
this case we have 



^n-t-l — AXji f (xji, h(^X-f)^ , 

yn +1 = h{Xn+l) = Bh{Xn) + g{Xn, h{Xn)) 



(18.4.4) 
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or 



A/”(/i(x)) = hi^Ax + f[x, h{x))) — Bh{x) — g{x, h{x)) = 0. (18.4.5) 

(Note: the reader should compare (18.4.5) with (18.1.10).) The next theo- 
rem justifies the approximate solution of (18.4.5) via power series expan- 
sions. 

Theorem 18.4.3 (Approximation) Let </> : — >■ K'* be a map with 

4>{0) = 0,^'(0) = 0, and Af(4>{x)) = 0{\x\^) as x — >■ 0 for some q > 1. 
Then 

h{x) = (j){x) + 0(|x|^) as X — >■ 0. 

Proof: See Carr [1981]. □ 



We now give an example. 



Example 18.4.1. Consider the map 

/ u\ /-I 0 0Wm\ / vw \ 

j V I !->■ j 0 —4 ^ 1 ( ^ 1 ”*” ( 1 ’ («,«, w) € R®. (18.4.6) 

\wj V 0 0 1/ \wj \-uvJ 

It should be clear that (u,v,w) = (0,0,0) is a fixed point of (18.4.6), and the 
eigenvalues associated with the map linearized abont this fixed point are —1, — 

4. Thus, the linear approximation does not suffice to determine the stability or 
instability. We will apply center manifold theory to this problem. 

The center manifold can locally be represented as follows 

IT'^(O) = |(u, u, w) € I u = hi{u), w = h 2 {u), hi{0) = 0, 

D/ii(0) = 0,i = l,2j (18.4.7) 

for u sufficiently small. Recall that the center manifold must satisfy the following 
equation 



M(^h{x)j = hyAx -I- f(^x, h{x)jj — Bh{x) — h(x)j = 0, 
where, in this example. 



x = u, y={v,w), 

A = -l, 




f{u, V, w) = vw, 



g{u,v,w) 




h = (hi, /i2). 



We assume a center manifold of the form 

hf 'I — ^i(^) ^ _ f \ 

^ \h 2 lu) J \a2U^ + b2U^ + 0{vi^) J ’ 



(18.4.8) 



(18.4.9) 



(18.4.10) 
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Snbstituting (18.4.10) into (18.4.8) and using (18.4.9) yields 
A/’^/i(m)^ 

( -\!2 0 \ /an 

0 1/2 y y 02«^ + b2u^ + ■ 



aiv? — biu^ + 0{u^) 
tt2U^ — b2U^ + 0(u^) 

— 1/2 0 \ / aiv? + biu^ + ■ ■ ■ 



u 

—uhi{u) 



Balancing powers of coefficients for each component gives 



ai + |ai — 1 
02 — 1 02 
—bi + ibi 



-&2 - I&2 + Ol 



“1 = 1 

02 = 0 ’ 

^ &1 = 0 

&2 = Ol I 



(18.4.11) 



(18.4.12) 



4 1 
9 



hence, the center manifold is given by the graph of /i 2 (w)j , where 



hi{u) = + 0{u^), 

h2(u) = + 0{u^). 



(18.4.13) 



The map on the center manifold is given by 



W 




FIGURE 18.4.1. 

Ml > -M + + 0{U% 

thns, the origin is attracting; see Fignre 18.4.1. 



(18.4.14) 



End of Example 18.4.1 
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Example 18.4.2. Consider the map 



{x,y) e : 



(18.4.15) 



The origin is a fixed point of the map. Computing the eigenvalues of the map 
linearized about the origin gives 

Ai,2 = l,i. 

Therefore, there is a one-dimensional center manifold and a one-dimensional sta- 
ble manifold with the orbit structure in a neighborhood of (0, 0) determined by 
the orbit structure on the center manifold. 

We wish to compute the center manifold, but first we must put the linear part 
in block diagonal form as given in (18.4.1). The matrix associated with the linear 
transformation has columns consisting of the eigenvectors of the linearized map 
and is easily calculated. It is given by 



-1 2 
1 -1 



Thus, letting 



our map becomes 



We seek a center manifold 



-2{u + vf 
(u + v)^ 



W%0) = {(u,v)\v = h{u)-, h{0) = Dh{0) = 0 } 
for u sufficiently small. The next step is to assume h{u) of the form 

h{u) = au^ -I- hu^ + 0{u^) 

and substitute (18.4.19) into the center manifold equation 



where, in this example, we have 



41 = 1 , 
„ 1 



(18.4.16) 



(18.4.17) 



(18.4.18) 



(18.4.19) 



= h^Au -I- f(u, — Bh{u) — g(u, h{u)^ = 0, (18.4.20) 



f{u,v) = -2{u + vy, 
g{u,v) = {u + vf, 



(18.4.21) 
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and (18.4.20) becomes 

a^u — 2(u + au^ + bu^ + 0{u*‘)j ^ 

+ b^u — 2(^u + au^ + bu^ + 0{u'^)j ^ 

+ ■ ■ ■ “ ^ (^au^ + bu^ + 0{u^)^ — ^« + au^ + bu^ + 0{u^'^ = 0 . 

(18.4.22) 

or 

avj^ + hi? — + 0{u^) = 0. (18.4.23) 




FIGURE 18.4.2. 

Equating coefficients of like powers to zero gives 
2 1 

u : a — -a = 0 ^ a = 0, 

: 6- ^6- 1 = 0 ^ = 2. (18.4.24) 

Thus, the center manifold is given by the graph of 

h{u) = 2u^ + 0{u'^), (18.4.25) 

and the map restricted to the center manifold is given by 

mi-^m-2(^m + 2m® + C)(m'‘)^^ (18.4.26) 

or 

u-2u^ + 0{u‘^). (18.4.27) 

Therefore, the orbit structure in the neighborhood of (0, 0) appears as in Fignre 
18.4.2 and (0,0) is stable. 



End of Example 18.4.2 
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Some remarks are now in order. 

Remark 1. Parametrized Families of Maps. Parameters can be included as 
new dependent variables for maps in exactly the same way as for vector 
fields in Section 18.2. 

Remark 2. Inclusion of Linearly Unstable Directions. The case where the 
origin has an unstable manifold can be treated in exactly the same way as 
for vector fields in Section 18.3. 



18.5 Properties of Center Manifolds 

In this brief section we would like to discuss a few properties of center 
manifolds. More information can be obtained from Carr [19811 or Siibrand 
[1985]. 



y 




Uniqueness 

Although center manifolds exist, they need not be unique. This can be seen 
from the following example due to Anosov (see Sijbrand [1985]). Consider 
the vector field 

X = , 

y = -y, (18.5.1) 

Clearly, (x, y) = (0, 0) is a fixed point with stable manifold given by x = 0. 
It should also be clear that y = 0 is an invariant center manifold, but there 
are other center manifolds. 

Eliminating t as the independent variable in (18.5.1), we obtain 

dy _ -y 
dx x^ 



(18.5.2) 
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The solution of (18.5.2) (for x ^ 0) is given by 

y{x) = (18.5.3) 

for any real constant a. Thus, the curves given by 

W^{0) = {(x, y) G I y = for x < 0, y = 0 for x > O} (18.5.4) 

are a one-parameter (parametrized by a) family of center manifolds of 
(x,y) = (0,0); see Figure 18.5.1. 

This example immediately brings up two questions. 

1. In approximating the center manifold via power series expansions 
according to Theorem 18.1.4, which center manifold is actually being 
approximated? 

2. Is the dynamical behavior the “same” on all of the center manifolds 
of a given fixed point? 

Regarding Question 1, it can be proven (see Wan [1977], Carr [1981] or 
Sijbrand [1985]) that any two center manifolds of a given fixed point differ 
by (at most) transcendentally small terms (cf. (18.5.4)). Thus, the Taylor 
series expansions of any two center manifolds agree to all orders. 

This fact emphasizes the importance of Question 2 from a practical point 
of view. However, it can be shown that due to the attractive nature of the 
center manifold, certain orbits that remain close to the origin for all time 
must be on every center manifold of a given fixed point, for example, fixed 
points, periodic orbits, homoclinic orbits, and heteroclinic orbits. 

Differentiability 

From Theorem 18.1.2 we have that if the vector field is C”, then the center 
manifold is also C”. However, if the vector field is analytic, then the center 
manifold need not be analytic; see Sijbrand [1985]. 

Preservation of Symmetry 

Suppose that the vector field (18.1.1) possesses certain symmetries (e.g., 
it is Hamiltonian). Does the vector field restricted to the center manifold 
possess the same symmetries? See Ruelle [1973] for a discussion of these 
issues. 

Preservation of Hamiltonian Structure 

For a Hamiltonian vector field, the restriction of the vector field to the 
center manifold gives rise to a Hamiltonian vector field. A proof of this can 
be found in Mielke [1991]. 
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18.6 Final Remarks on Center Manifolds 

A Coordinate Independent Center Manifold Reduction. We devel- 
oped center manifold theory by assuming that the linear part of the 
vector field was in block diagonal form, see, e.g., (18.1.1). However, 
been [1993] has developed a coordinate independent approach to the 
center manifold reduction. 

Center Manifolds for Stochastic Dynamical Systems. A center man- 
ifold theory for stochastic dynamical systems has been developed by 
Boxler [1989], [1991]. 

18.7 Exercises 



1. Consider the (r > 1) map 



X I— f{x), X C (18.7.1) 

Suppose that the map has a fixed point at x — xq, i.e., 

Xo = f(xo). 



Next consider the vector field 

X — f(x) — X. (18.7.2) 

Clearly (18.7.2) has a fixed point, and x — xq. What can you determine about the 
orbit structure near the fixed point of the map (18.7.1) based on knowledge of the 
orbit structure near the fixed point x — xq oi the vector field (18.7.2)? 



2. Consider the map 



and denote the Taylor expansion of / by 



f{x) = ao + aix + • • • + ar-ix'^ ^ + 0(|x|’^). 

Suppose / is identically zero. Then show that = 0, 1 = 0, . . . , r — 1. Does the same 
result hold for the map 



/:1R^ ^ n > 1? 



3. Study the dynamics near the origin for each of the following vector fields. Draw phase 
portraits. Compute the center manifolds and describe the dynamics on the center 
manifold. Discuss the stability or instability of the origin. 



a) S - -9 + v , G X Rb 

V — — sin 9, 



b) 



d) 



X — —X y X y, / X , 

2 (x,v)ei 

y = X + 2y + y , 

X — X — 2y, 



y — Sx — y — X , 

X — 2x 2y, 
y — X y x"^ , 



{x, y) G 1 



(x, y) G R^. 
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]^y y < (x, y) G 1 



h) 

i) 

j) 

k) 



X — -2x + 3y + y 
y — 2x — 3y , 

X — —x — y — xy^ 
y — 2x y 2xy, 

X — —x + y, 

y = • 



(x,y) G K 
(x,y) G 

(a;, v) e : 



- —2x + y + z + y z, 
■■ X — 2y z + xz ^ , 

■ X + y — 2z + x^y, 



{x,v,z) G ! 



— X — y + 2 , 

2x + y - z^, {x, y, z) 6 R^. 

X + 2y — z, 

-X - y - z - yz, 

— X — y — z — xz, {x, y,z)^. 

— X — y — z — xy, 



X — y X , 
y ^ -y - x^ 



y = -y - X 



(x, y) G 1 



(x, y) G 1 



I o I 2 I 4 5 
■ —X + 2y X y + X y , 
4 6. 89 

: y - X y X y , 



(x, y) £ ] 



4. Consider the following parametrized families of vector fields with parameter e G 

For £ = 0, the origin is a fixed point of each vector field. Study the dynamics near the 
origin for e small. Draw phase portraits. Compute the one-parameter family of center 
manifolds and describe the dynamics on the center manifolds. How do the dynamics 
depend on £? Note that, for e = 0, e.g., a) and a^) reduce to a) in the previous exercise. 
Discuss the role played by a parameter by comparing these cases. In, for example, a) 
and a^), the parameter e multiplies a linear and nonlinear term, respectively. Discuss 
the differences in these two cases in the most general setting possible. 



a) S-S + ev + v"^, (S, a) G x rC 

V — — sin 9, 

6 = —6 + + ev^ , 

V — — sin 9 



b) 

b') 



c') 



1 2 

-X + y + X y, 

X + 2y + ey + y'^ , 

1 2 

-X + y + X y, 

X + 2y + y'^ + ey^ , 



{x, y) G R^. 



X — 2y ex, 
3x — y — x^ , 



(x, y) G R^. 



X — 2y ex '^ , 
3a: — y — 



a: = 2a: -|- 2y -|- ey, 
y — X -\- y -\- x"^ , 



{x,y) G R^. 



X — 2x -\- 2y, 
y = a:-|-y-|-a:^+ ey^, 
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P '1 X = -y - ex - y 
’ V = 2x, 



(x, y) e : 



e') == = -y-y , 
y — 2x sx^ , 



^ — — 2ic -\-3y-\-sx-\-y 
y — 2x — 3y x^ , 



i^,y) ^ - 



X — —2x 3y y ex 
y — 2x — 3y x^ , 

. X — —X — y + ex — xy, 
y = 2x + y + 2xy, 



(x, y) G : 



g'-j X = —X — y — xy + ex , 
y = 2x + y + 2xy, 

X = -X + y, 

• X I —X I r, I 

y — —e + e +2x + ey, 



(x, y) G : 



X — —X + y + ex , 

• X I X I 

y — — e + e + 2x, 



X — —2x + y + 2 + ex — y z, 
y — X — 2y + z + ex + xz^, 
z — x + y — 2z + ex + x'^y, 



(x, y, z) G 1 



X — —2x y -\- z ex -\- y z, 
i^) y — X — 2y z -\- exy + xz'^ , 
z — x-\-y — 2z-\- x^y. 

X ^ -X - y + z'^ , 

j) y = 2 £c + y + ey - z^, (x,y,z)^^. 

z — X 2y — z^ 



X — —X — y ex + 2 , 

j') y = 2a; + y - z^ + ey^, 

2 = X + 2y — 2. 

X — —X — y — z ex — yz, 
k) y — —X — y — z — xz, 
z — —X — y — z — yz, 



(x, y, z) e 1 



X — —X — y — z — yz ex^ , 
k ) y — —X — y — z — xz, 
z — —X — y — z — xy. 



X ^ y + x + ey, 
y ^ -y - X , 



(x, y) e ! 



J') X = y + X + ey , 

y = -y - x^, 



X — x'^ ey, 

y ^ -y - 



(x, y) e 1 



/N, X — X + ey , 
y ^ -y - X . 

5. Study the dynamics near the origin for each of the following maps. Draw phase por- 
traits. Compute the center manifold and describe the dynamics on the center manifold. 
Discuss the stability or instability of the origin. 



X 1-^ X — y — xy 

ai 2 ^ ^ 

^ 1 2 
y ^ — — X X , 



(x, y) G R . 
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b) 



d) 



h) 



X + 2y + x^, 
2x + y, 

—x + y — xy^ 
V + 

2x + y, 

2x + Zy + x^ ^ 

x + 2y + y^, 
2x + 3y, 

I 2 , 2 

X + X + xy , 
3 

X — z , 

2x - y, 
,1.3 

X -\ Z X , 

2 



{x,y) e K^. 
(x, y) e R^. 
{x, y) e R^. 
{x,y) e R^. 
{x,y) e R^ 

(x, y, z) G R^ 



X + ^ 

-x-2y-x^, (x,y,z) 



I 2 
V + X , 

y + xy, 



(x, y) e R^. 



j) 



X 

y 



2 



X , 



y + xy, 



(x,y) e R^ 



6. Consider the following parametrized families of maps with parameter £ G . For e = 0, 
the origin is a fixed point of each vector field. Study the dynamics near the origin for 
€ small. Draw phase portraits. Compute the one-parameter family of center manifolds 
and describe the dynamics on the center manifolds. How do the dynamics depend on e? 
Note that, for e = 0, e.g., a) and a^) reduce to a) in the previous exercise. Discuss the 
role played by a parameter by comparing these cases. In, e.g., a) and a^), the parameter 
€ multiplies a linear and nonlinear term, respectively. Discuss the differences in these 
two cases in the most general possible setting. 



X I— >• ~2^ ~ y ~ xy 



y ^ --X ey + x , 



(x, y) G i 



b) 
b') 

c) 

c') 

d) 



X 1-^ ~2^ ~ y ~ ^y 



y ^ --y ey x . 

X X 2y x^ , 

y 2x + y + sy, 

X X 2y x^ , 

y 2x y ey^ . 

I 2 

X I— >• —X + y — xy , 
y i-J- y + ey + x^y, 

X 1-^ —X + y — xy^ , 
y y + + x'^y. 



{x,y) G 1 



(x, y) G 1 



X \-¥ 2x y,, 

y 2x ^y ex X 



,4 (x, y) G ! 



d') 



X 

y 



2x y ex ^ , 

2a: -1- 3y -b 
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X X ey, 
y ^ X + 2 y + y^ , 



{x,v) e 1 



e') 

f) 

f') 



X \-¥ X ey , 
y ^ x + 2 y + y'^ . 

X \-¥ 2 x 3 y, 
y X sy X + xy , 
X I— >• 2 a: + 3 y, 

X I 2 I 2 I 2 

yi-^a: + a: +ey + a:y 



(a:, y) S i 



a: 1-^ rc — 2 , 
y 2 x — y -\- ey, 



21— >-a:+— 2 + a: , 



(x, y, z) e 1 



g') 

h) 

h') 

i) 
i') 

j) 

j') 



a: 1-^ a: — 2 , 
y 2x — y -\- ey^ , 

I 1 I 3 

21— >-a:+— 2 + a: . 

a: 1-^ a: + sz ^ , 
y I— >• —X — 2y — x^ . 



(x, y, z) G 1 



z ^ y - -z + y , 

a: 1-^ a: + ea: + 2*^, 

o 3 

—X — 2 y — X 



y '■ 



z ^ y - -z + y . 

a: I— >• y + ea: + a:^ , 
y ^ y + xy, 

a: I— >• y + a:^ , 
y 1-^ y + a:y + ex"^ . 

X I 2 

a: 1-^ ea: + a: , 
y ^ y + xy, 

a: 1-^ a:^ + ey, 
y ^ y -\-xy. 



i^,y) ^ '■ 



{x, y) G J 



7. In Chapter 3 we illustrated the graph transform method and the Liapunov-Perron 
method for proving the existence of stable and unstable manifolds associated with a 
hyperbolic fixed point by applying the techniques to the specific vector field: 



y ^ -y + a: , (a:,y) G 

Now let’s consider a slight modification of this example: 

2 

x — x ^ 

V = -v + x^, (x,y) G 



For this example the origin is a nonhyperbolic fixed point with a one dimensional center 
manifold and a one dimensional stable manifold. Use the graph transform, Liapunov- 
Perron, and Taylor series methods to prove the existence of a center manifold of the 
origin. 

Recall from Example 7.2.1 of Chapter 7 that the equation a: = is an example of a 
vector field whose solutions “blow up in finite time”. Does this fact have any bearing 
on the existence of a center manifold? (Note: this provides another setting where the 
reader should think of the difference between “trajectories” and “manifolds made up 
of trajectories”. One is a dynamical object, the other is a geometrical object.) 
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Normal Forms 



The method of normal forms provides a way of finding a coordinate system 
in which the dynamical system takes the “simplest” form, where the term 
“simplest” will be defined as we go along. As we develop the method, three 
important characteristics should become apparent. 

1. The method is local in the sense that the coordinate transformations 
are generated in a neighborhood of a known solution. For our pur- 
poses, the known solution will be a fixed point. However, when we 
develop the theory for maps, the results will have immediate applica- 
tions to periodic orbits of vector fields by considering the associated 
Poincare map (cf. Chapter 10). 

2. In general, the coordinate transformations will be nonlinear functions 
of the dependent variables. However, the important point is that these 
coordinate transformations are found by solving a sequence of linear 
problems. 

3. The structure of the normal form is determined entirely by the nature 
of the linear part of the vector field. 

We now begin the development of the method. 



19.1 Normal Forms for Vector Fields 

Consider the vector field 

w = G{w), wGK”, (19.1.1) 

where G is C”, with r to be specified as we go along (note: in practice we 
will need r > 4). Suppose (19.1.1) has a fixed point at w = wq. 



19.1a Preliminary Preparation of the Equations 

We first want to perform a few simple (linear) coordinate transformations 
that will put (19.1.1) into a form which is easier to work with. 
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1. First we transform the fixed point to the origin by the translation 

V = w — Wq, V G M", 

under which (19.1.1) becomes 

V = G{v + Wq) = H{v). (19.1.2) 

2. We next “split off” the linear part of the vector field and write (19.1.2) 
as follows 

V = DH{0)v + H{v), (19.1.3) 

where H{v) = H{v) — DH{0)v. It should be clear that H{v) = 

3. Finally, let T be the matrix that transforms the matrix DH{0) into 
(real) Jordan canonical form. Then, under the transformation 

v = Tx, (19.1.4) 

(19.1.3) becomes 

x = T-^DH{0)Tx + T~^H{Tx). (19.1.5) 

Denoting the (real) Jordan canonical form of DH{0) by J, we have 

J = T-^DH{0)T, (19.1.6) 

and we define 

F{x) = T~^H{Tx) 

so that (19.1.4) is alternately written as 

x = Jx + F{x), a;GK”. (19.1.7) 

We remark that the transformation (19.1.4) has simplified the lin- 
ear part of (19.1.3) as much as possible. We now begin the task of 
simplifying the nonlinear part, F(x). 

First, we Taylor expand F{x) so that (19.1.7) becomes 

X = J X F 2 {x) J- F^(^x) J- • • • J- F , — i(x) J- CJ(|a;|”), (19.1.8) 



where Fi{x) represent the order i terms in the Taylor expansion of F(x). 
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19.1b Simplification of the Second Order Terms 

We next introduce the coordinate transformation 

x = y + h2{y), (19.1.9) 

where /i 2 (y) is second order in y. Substituting (19.1.9) into (19.1.8) gives 
X = (id+Dh 2 (y))y = Jy+ Jh 2 (y) + 0(y + /i 2 (y)) 

+Fs(y + h2(y)) H h Fr-i(y + h2(y)) + e>(|yr),(19.1.10) 

where “id” denotes the n x n identity matrix. Note that each term 

Fk{y + h 2 (y)), 2 < k < r - 1, (19.1.11) 

can be written as 

Fk{y) + 0(|y|'=+i) + • • • + 0(|yP"), (19.1.12) 

so that (19.1.10) becomes 

{id+Dh 2 {y))y = Jy+ Jh 2 {y) + F 2 {y) + Fb(y) 

+ ... + Fr-i{y) + 0{\yn, (19.1.13) 

where the change of notation for the 0{\y\^) terms, to ffc(y), serves to 
denote that they have been modified as a result of the coordinate transfor- 
mation. 

Now, for y sufficiently small, 

{id+Dh2{y))~" (19.1.14) 

exists and can be represented in a series expansion as follows (see Exercise 

3 ) 

(id+Dh 2 {y)) " =id-Dh2{y)+0{\y\^). (19.1.15) 

Substituting (19.1.15) into (19.1.13) gives 

y=Jy + Jh 2 {y) - Dh 2 {y)Jy + F 2 {y) + F^iy) 

+ --- + Fr-i{y) + 0{\yn. (19.1.16) 

Up to this point h 2 {y) has been completely arbitrary. However, now we 
will choose a specific form for h 2 {y) so as to simplify the 0{\y\^) terms as 
much as possible. Ideally, this would mean choosing h 2 {y) such that 

Dh2{y)Jy - Jh2{y) = F2{y), (19.1.17) 

which would eliminate ^ 2 ( 2 /) from (19.1.16). Equation (19.1.17) can be 
viewed as an equation for the unknown /i 2 (y)- We want to motivate the 
fact that, when viewed in the correct way, it is in fact a linear equation 
acting on a linear vector space. This will be accomplished by 1) defining 
the appropriate linear vector space; 2) defining the linear operator on the 
vector space; and 3) describing the equation to be solved in this linear 
vector space (which will turn out to be (19.1.17)). We begin with Step 1. 
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Step 1. The Space of Vector-Valued Homogeneous Polynomials of Degree 
Hk 

Let {si, • • • , s„} denote a basis of K”, and let j/ = (j/i, • • • , j/„) be coordi- 
nates with respect to this basis. Now consider those basis elements with 
coefficients consisting of homogeneous polynomials of degree k, i.e., 

n 

{yTyr---yn'^)s^, = (i^.i.is) 

i=i 



where mj > 0 are integers. We refer to these objects as vector-valued homo- 
geneous polynomials of degree k. The set of all vector-valued homogeneous 
polynomials of degree k forms a linear vector space, which we denote by 
Hk ■ An obvious basis for Hk consists of elements formed by considering all 
possible homogeneous polynomials of degree k that multiply each Si. The 
reader should verify these statements. Let us consider a specific example. 



Example 19.1.1. We consider the standard basis 




(19.1.19) 



on and denote the coordinates with respect to this basis by x and y, respec- 
tively. Then we have 



H 2 = Span 




(19.1.20) 



End of Example 19.1.1 



Step 2. The Linear Map on Hk 

Now let us reconsider equation (19.1.17). It should be clear that /i 2 (y) can 
be viewed as an element of H 2 - The reader should easily be able to verify 
that the map 

h2{y)' — > Dh2{y)Jy - Jh2{y) (19.1.21) 

is a linear map of H 2 into H 2 - Indeed, for any element hk(y) G 77fc, it 
similarly follows that 



hk{y)' — > Dhk{y)Jy - Jhk{y) (19.1.22) 

is a linear map of Hk into Hk- 

Let us mention some terminology associated with Eq. (19.1.17) that has 
become traditional. Due to its presence in Lie algebra theory (see, e.g., 
Olver [1986]) this map is often denoted as 

Lf\hk(y)) = -{Dhk{y)Jy - Jhk{y)) 



(19.1.23) 




274 



19. Normal Forms 



or 

-{Dhk{y)Jy - Jhk{y)) = [hk{y),Jy\, (19.1.24) 

where [•, •] denotes the Lie bracket operation on the vector fields hk{y) and 
Jy. 

Step 3. The Solution of (19.1.17) 

We now return to the problem of solving (19.1.17). It should be clear that 
7^2 (y) can be viewed as an element of H 2 . From elementary linear algebra, 
we know that H 2 can be (nonuniquely) represented as follows 

H2 = Lf{H2)®G2, (19.1.25) 

where G 2 represents a space complementary to L^p{H 2 ). Solving (19.1.17) 
is like solving the equation Ax = b from linear algebra. If ^ 2 ( 2 /) is in the 
range of Lj (•), then all 0{\y\ ) terms can be eliminated from (19.1.17). 
In any case, we can choose h 2 {y) so that only 0{\y^) terms that are in G 2 
remain. We denote these terms by 

F^{y) G G 2 (19.1.26) 

(note: the superscript r in (19.1.26) denotes the term “resonance,” which 
will be explained shortly). 

Thus, (19.1.16) can be simplified to 

y=Jy + F^iy) + F^iy) + • • • + F',_i(y) + 0(|yD. (19.1.27) 

At this point the meaning of the phrase “simplify the second-order terms” 
should be clear. It means the introduction of a coordinate change such that, 
in the new coordinate system, the only second-order terms are in a space 
complementary to L) '{H 2 ). If Ly {H 2 ) = H 2 , then all second-order terms 
can be eliminated. 

19.1c Simplification of the Third Order Terms 

Next let us simplify the 0(|j/|^) terms. Introducing the coordinate change 

— >y + h3{y), (19.1.28) 

where h 3 {y) = (!I(|y|^) (note: we will retain the same variables y in our 
equation), and performing the same algebraic manipulations as in dealing 
with the second-order terms, (19.1.27) becomes 

y = Jy + F2{y) + Jhsiy) - Dh3{y)Jy + Fs^y) + F i{y) 

+ ■ ■ ■ + Fj—i{y) + 0{\y\ ), (19.1.29) 
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where the terms Fk{y), 4 < k < r — 1, indicate, as before, that the coor- 
dinate transformation has modified the terms of order higher than three. 
Now, simplifying the third-order terms involves solving 

Dh^{y)Jy-Jh^{y) = h{y)- (19.1.30) 

The same comments as for second-order terms apply here. The map 

hz{y) I — Dh3{y)Jy - Jhsiy) = -L^\h3{y)) (19.1.31) 

is a linear map of H 3 into H 3 . Thus, we can write 

H3 = Lf{H3)(BG3, (19.1.32) 

where G 3 is some space complementary to Lj {H 3 ). Thus, the third-order 
terms can be simplified to 

F^{y) G G3. (19.1.33) 

|'Q\ 

If Lj {H 3 ) = H 3 , then the third-order terms can be eliminated. 

19. Id The Normal Form Theorem 

Clearly, this procedure can be iterated so that we obtain the following 
normal form theorem. 

Theorem 19.1.1 (Normal Form Theorem) By a sequence of analytic 
coordinate changes (19.1.8) can he transformed into 

y=Jy + F^{y) + ■ ■ ■ + Ff_,{y) + 0{\yn, (19.1.34) 

where Ff{y) G Gk, 2 < k < r — 1, and Gk is a space complementary to 
L^j\Hk). Equation (19.1.34) is said to be in normal form through order 
r — 1. 



Several comments are now in order. 

1. The terms Ff{y), 2 < k < r — 1, are referred to as resonance terms 
(hence the superscript r) . We will explain what this means in Section 
19.12. 

2. The structure of the nonlinear terms in (19.1.34) is determined en- 
tirely by the linear part of the vector field (i.e., J). 

3. It should be clear that simplifying the terms at order k does not 
modify any lower order terms. However, terms of order higher than 
k are modified. This happens at each step of the application of the 
method. If one wanted to actually calculate the coefficients on each 
term of the normal form in terms of the original vector field, it would 
be necessary to keep track of how the higher order terms are modified 
by the successive coordinate transformations. 
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Example 19.1.2 (The Takens-Bogdanov Normal Form). We want to compute 



the normal form for a vector held on 
where the linear part is given by 



Second-Order Terms 



We have 



H 2 = span 



in the neighborhood of a hxed point 



(19.1.35) 



(19.1.36) 



We want to compute Lj \H 2 )- We do this by computing the action of Lj (•) on 
each basis element on H 2 



. ( 2 ) / xy 



From (19.1.37) we have 

L^P{H 2 ) = span 



Clearly, from this set, the vectors 



(19.1.37) 



(19.1.38) 



(19.1.39) 
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are linearly independent and, hence, second-order terms that are linear combi- 
nations of these four vectors can be eliminated. To determine the nature of the 
second-order terms that cannot be eliminated (i.e., F 2 {y)), we must compute a 
space complementary to Lj (H 2 ). This space, denoted G 2 , will be two dimen- 
sional. 

In computing G 2 it will be useful to first obtain a matrix representation for the 
linear operator {■). This is done with respect to the basis given in (19.1.36) by 
constructing the columns of the matrix from the coefficients multiplying each ba- 
sis element that are obtained when lP {■) acts individually on each basis element 
of H 2 given in (19.1.36). Using (19.1.37), the matrix representation of lP [■) is 
given by 



/ 0 001 0 0 \ 

-2 0 0 0 1 0 

0 -1 0 0 0 1 

0 0 0 0 0 0 

0 0 0 -2 0 0 

Vo 0 0 0 -1 0/ 



(19.1.40) 



One way of finding a complementary space G 2 would be to find two “6-vectors” 
that are linearly independent and orthogonal (using the standard inner product 
in R®) to each column of the matrix (19.1.40), or, in other words, two linearly 
independent left eigenvectors of zero for (19.1.40). Due to the fact that most 
entries of (19.1.40) are zero, this is an easy calculation, and two such vectors are 
found to be 



Hence, the vectors 



0 




0 


0 




0 


0 




1 


1 




0 


Vo^ 




Vo/ 




r 0 


,2^y) ’ 





(19.1.41) 



(19.1.42) 



span a two-dimensional subspace of H 2 that is complementary to lP{H 2 ). This 
implies that the normal form through second-order is given by 



X = y + aix^ + 0{3), 

y = U 2 xy + UiX^ + 0{3), (19.1.43) 



where 01,02, and 03 represent constants. 

Now our choice of G 2 is certainly not unique. Another choice might be 



G 2 = span 



(19.1.44) 



This complementary space can be obtained by taking the vector 

x^ 

\xy 



(19.1.45) 
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given in (19.1.42) and adding to it the vector 




contained in L^p{H 2 )- This gives the vector 




(19.1.46) 



(19.1.47) 



(and the multiplicative constant | is irrelevant). For the other basis element of 
the complementary space, we simply retain the vector 

(19.1.48) 




given in (19.1.42). With this choice of G 2 the normal form becomes 

X = y + + 0(3), 

y = a2X^ + 0{Z). (19.1.49) 



This normal form near a fixed point of a planar vector held with linear part given 
by (19.1.35) was hrst studied by Takens [1974]. 

Another possibility for G 2 is given by 

G2=span{(^°),(j^J}, (19.1.50) 

where the second vector is obtained obtained by subtracting the third vector in 
(19.1.39) from the hrst vector in (19.1.42). With this choice of G 2 the normal 
form becomes 



X = y + 0{3), 

y = aix^ + b 2 xy + 0{3)-, (19.1.51) 

this is the normal form for a vector held on near a hxed point with linear part 
given by (19.1.35) that was hrst studied by Bogdanov [1975]. 

End of Example 19.1.2 



19.2 Normal Forms for Vector Fields with 
Parameters 

Now we want to extend the normal form techniques to systems with pa- 
rameters. Consider the vector field 






X G K", IIGICW 



(19.2.1) 




19.2 Normal Forms for Vector Fields with Parameters 



279 



where I is some open set in and / is C’’ in each variable. Suppose that 

/(0,0) = 0 (19.2.2) 

(note: the reader should recall from the beginning of Section 19.1 that 
there is no loss of generality in assuming that the fixed point is located 
at (x,/i) = (0,0)). The goal is to transform (19.2.1) into normal form 
near the fixed point in both phase space and parameter space. The most 
straightforward way to put (19.2.1) into normal form would be to follow 
the same procedure as for systems with no parameters except to allow the 
coefficients of the transformation to depend on the parameters. Rather than 
develop the general theory along these lines, we illustrate the idea with a 
specific example which will be of much use later on. 



19.2a Normal Form for The 

Poincare- Andronov-Hopf Bifurcation 

Suppose a; e and 72/(0, 0) has two pure imaginary eigenvalues A(0) = 
±zw(0). Then we can find a linear transformation which puts Dxf{0,fx) in 
the following form 

=(lm AM RrAM*) 

for /i sufficiently small. Also, by the implicit function theorem, the fixed 
point varies in a C’’ manner with fj. (for fj, sufficiently small) such that, 
if necessary, we can introduce a parameter-dependent coordinate transfor- 
mation so that x = 0 is a fixed point for all fi sufficiently small. We will 
assume that this has been done. 

Letting 

Re A(/r) = |A(/r)| cos(27r0(/x)), 

Im A(/r) = |A(/x)| sin(27T0(^)), (19.2.4) 

it is easy to see that (19.2.3) can be put in the form 

Now we want to put the following equation into normal form 

/ cos27r6*(/r) — sin27r0(^)\ f x\ 

sin 27r0(/r) cos 27 t 6 *(^) ) \y ) 




where the /* are nonlinear in x and y. 
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We remark that we will frequently omit the explicit parameter depen- 
dence of A, 9, and possibly other quantities from time to time for the sake 
of a less cumbersome notation. 

In dealing with linear parts of vector fields having complex eigenvalues, 
it is often easier to calculate the normal form using complex coordinates. 
We will illustrate this procedure for this example. 

We make the following linear transformation 




to obtain 






\F\zrz-,9))^ 



(19.2.7) 



(19.2.8) 



where 



F^{z,z;n) = f'^{x{z,z),y{z,z);fi) +if{x{z,z),y{z,z);ii), 

F'^{z,z;y) = f'^{x{z,z),y{z,z);y) - if{x{z,z),y{z,z);fj,). 
Therefore, all we really need to study is 

z=\\\e^^^^z + F^{z,z-,y), (19.2.9) 

since the second component of (19.2.8) is simply the complex conjugate 
of the first component. We will therefore put (19.2.9) in normal form and 
then transform back to the x,y variables. 

Expanding (19.2.9) in a Taylor series gives 

i = |A| + F2 + F3 + --- + Fr-i + 0{\z\^ , Izf), (19.2.10) 

where the Fj are homogeneous polynomials in z, z of order j whose coeffi- 
cients depend on y. 

Simplify Second-Order Terms 

We make the transformation 

zi — > z -\- h 2 {z, z), (19.2.11) 

where h 2 {z,z) is second-order in z and z with coefficients depending on y. 
We neglect displaying the explicit p dependence. 

Under (19.2.11), (19.2.10) becomes 

i ^1 -I- = Xz -\- A/i 2 + F2{z, z) -\- 0(3) 



(19.2.12) 
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or 



i= 1 + 



dh2 

dz 



-1 r 



3h 

Xz + \h 2 - ^Z + F 2 + 0{?,) 
oz 



Note that we have 

Z = \z + p 2 + d(3) 
and, for z, z sufficiently small 

-1 



1 + 



dh2 

dz 



= l-|?+0(2). 



Thus, using (19.2.13) and (19.2.14), (19.2.12) becomes 
r)h - r)h 

Z = \z — X-;^Z — X-^Z + Xh2 + F 2 + 0(3), 
oz oz 

so that we can eliminate all second-order terms if 



Xh2 



dh2 Tdh2_ 

A— — z -I- A^;— z 

Oz Oz 



+ F 2 = 0. 



(19.2.13) 



(19.2.14) 



(19.2.15) 



(19.2.16) 



Equation (19.2.16) is very similar to Eq. (19.1.17) derived earlier. The map 



Xh2- X 



Oh2 

Oz 



z -t“ A 



T 0h2 



Oz 



(19.2.17) 



is a linear map of the space of homogeneous polynomials in z and z of 
degree 2 into itself. We denote this space by H 2 - F 2 can also be viewed as 
an element in this space. Thus, solving (19.2.16) is a problem from linear 
algebra. 

Now we have 

7/2 = span {z^, zz, z^} . (19.2.18) 

Computing the action of the linear map (19.2.17) on each of these basis 
elements gives 



Az2 - 



Azz — 



Az2- 




z X 
z -t“ A 
z 4“ A 




z 



z 



z 



— Az^, 



— Azz, 



(A-2A)z^. 



Thus, (19.2.17) is diagonal in this basis with a matrix representation 
given by 

/-A(m) 0 0 \ 

0 -x{fi) 0 _ . (19.2.19) 

V 0 0 X{fj,)-2X{fi) J 

For ^ = 0, it should be clear that A(0) yf 0 and A(0) = — A(0); hence, 
for fjL sufficiently small, X{p) yf 0 and A(/r) — 2A(/r) yf 0. Therefore, for /x 
sufficiently small, all second-order terms can be eliminated from (19.2.10). 
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Simplify Third- Order Terms 
We have 

i = Az + Fa + 0(4). (19.2.20) 

Let z I — > z + hs^z, z); then we obtain 

z = ^1 H — — — ^ Az — ^ 3 (- 2 , z) + 0(4) 

= Az - A^z - A^z + Xhs + Fs + 0(4). 
oz az 

We want to solve 

A/i3-A^z-A^z + +3 = 0. (19.2.21) 

oz oz 

Note that we have 

H 3 = span {z^, z^z, zz^, z^} . (19.2.22) 

We compute the action of the linear map 

r)h - r)h 

hs^Xhs- X^z + X^z (19.2.23) 

y Oz az \ 

on each basis element of and obtain 





Therefore, a matrix representation for (19.2.23) is given by 

/-2A(/i) 0 _ 0 0 

0 -{X{pi) + X{pi)) 0 0 

0 0 -2A(/r) 0 

Vo 0 0 A(/r) - 3A(/r) 

Now, at /r = 0, 

A(0) + A(0) = 0; (19.2.26) 

however, none of the remaining columns in (19.2.25) are identically zero at 
fj, = 0. Therefore, for p, sufficiently small, third-order terms that are not of 
the form 




(19.2.27) 
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can be eliminated. 

Thus, the normal form through third-order is given by 

z = \z + c{^j)z^z + 0{A), (19.2.28) 

where is a constant depending on /x. 

Next we simplify the fourth-order terms. However, notice that, at each 
order, simplification depends on whether 

\h - (^Xz^ + \z^^ = 0 (19.2.29) 

for some h = where m -I- n is the order of the term that we want to 

simplify. Substituting this into (19.2.29) gives 

- {nXz^z^ + mXz^z^) = 0, 

(A - nA - mA) = 0. (19.2.30) 

At /i = 0, A = —A; hence we must not have 

l + m-n = 0. (19.2.31) 

It is easily seen that this can never happen if m and n are even numbers. 
Therefore, all even-order terms can be removed, and the normal form is 
given by 

z = Xz + c{n)z‘^z + 0{5) (19.2.32) 

for /i in some neighborhood of /i = 0. 

We can write this in cartesian coordinates as follows. Let A(/x) = a(/x) -I- 
tw(/x), and c(/x) = a(/x) -I- ib{^). Then 

x = ax — ujy + {ax — by){x^ + y^) + 0(5), 
y = ujx + ay + (bx + ay){x^ + y^) + 0(5). (19.2.33) 

In polar coordinates, it can be expressed as 

r = ar + ar^ + • • • , 

e = (jj + br^ + ■■■ . (19.2.34) 

We will study the dynamics associated with this normal form in great detail 
in Chapter 20 when we study the Poincare- Andronov-Hopf bifurcation. 

Differentiability 

We make the important remark that in order to obtain the normal form 
(19.2.32) the vector field must be at least C^. 
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19.3 Normal Forms for Maps 

We now want to develop the method of normal forms for maps. We will 
see that it is very much the same as for vector fields with only a slight 
modification. 

Suppose we have a O'" map which has a fixed point at the origin and is 
written as follows 

X I — >■ Jx + F2(x) H + Fr-i(x) + 0{\x\^) 

or 

Xn+l = JXn + F2{Xn) + ■ ■ ■ + Fr-l{Xn) + 0{\Xn\ ), (19.3.1) 

where x G K", and the Fj are vector- valued homogeneous polynomials of 
degree j. We introduce the change of coordinates 

x = y + h2{y), (19.3.2) 

where h 2 {y) is a vector valued homogeneous polynomial of degree 2. After 
this transformation (19.3.1) becomes 

Xn+l = yn+1 + h2{Vn+l) = Jy-n + Jh2{yn) + ^2(j/n) + 0{3) 
or 

(id +h2){yn+l) = Jyn + Jh2{yn) + F2{yn) + C^(3). (19.3.3) 

Now, for y sufficiently small, the function (id -|-/i 2 )(’) is invertible so that 
(19.3.3) can be written as 

J/n+l = (id-|-/l2) ^ {JVn + Jh 2 {yn) + F 2 {yn) + (19.3.4) 

For y sufficiently small, (id -|-/i 2 )~^(’) can be expressed as follows (see Ex- 
ercise 3) 

(id+/i2)-F‘) = (id-/i2 + 0(4))(-), (19.3.5) 

so that (19.3.4) becomes 

yn+i = Jy„ + Jh 2 {yn) - h 2 {Jyn) + F 2 (?/„) + 0(3). (19.3.6) 

Thus, we can eliminate the second-order terms if 

Jh2{y)-h2{Jy) + F2{y) = Q. (19.3.7) 

(Compare this with the situation for vector fields.) 

This process can be repeated, but it should be clear that the ability to 
eliminate terms of order j depends upon the operator 

hjijj) ' — ^ Jhj(y) - hj{Jy) = M^j\hj{y)), (19.3.8) 

which (the reader should verify) is a linear map of Flj into Hj , where Hj is 
the linear vector space of vector- valued homogeneous polynomials of degree 
j. The analysis proceeds as in the case for vector fields except the equation 
to solve is slightly different (it has a term involving a composition rather 
than a matrix multiplication). Let us consider an example which can be 
viewed as the discrete time analog of this example. 
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19.3a Normal Form for the Naimark-Sacker Torus 
Bifurcation 

Suppose we have a C’’ map of the plane 

XI — > f{x,fi), a; G (19.3.9) 



where I is some open set in Suppose also that (19.3.9) has a fixed 

point at a; = 0 for /a sufficiently small (cf. Example 19.2a) and that the 
eigenvalues of Df{0,fi), fj, small, are given by 

Ai = |A(/a)| Aa = |A(/x)| (19.3.10) 



i.e., Ai = A 2 . Furthermore, we assume that at /x = 0 the two eigenvalues 
lie on the unit circle, i.e., |A(0)| = 1. As in Example 19.2a, with a linear 
change of coordinates we can put the map in the form 



( 



X 

y 



) 



|A| 



f cos27rd 
sin 27 t6* 



— sin27T0\ / X 

cos 27T0 ) \V 



( P{x,y; y.)\ 
\P{x,y;fi) J ’ 



(19.3.11) 



where P{x, y; y) are nonlinear in x and y. 

Utilizing the same complex linear transformation as in Example 19.2a, 
we reduce the study of the two-dimensional map to the study of the one- 
dimensional complex map 

zi — > X{y)z + F^{z,z; y), (19.3.12) 

where f + if and X{y) = |A(m)| , 

We want to put this complex map into normal form. As a preliminary 
transformation, we expand F^{z, z; y) in a Taylor expansion in z and z 
with coefficients depending on y so that (19.3.12) becomes 



Zn+i — X{y)zn -l- E 2 Fr-1 + 0{r), (19.3.13) 



where Fj is a homogeneous polynomial of order j in z and z. 

Simplify Second-Order Terms 

Introducing the transformation 

zi — >z + h2{z,z), (19.3.14) 

where /i 2 (z, z) is a second-order polynomial in z and z with coefficients 
depending on y, (19.3.13) becomes 

Zn-i-l + h2{Zn+l,Zn+l) = Az„ + A/l2(2n, zf + E 2 (z„, Z„) + 0(3) 



or 



Zn+l = XZn + Xh2{z„,Z„) - ft-2 (-^n-l-l , 2„+l ) -f U2(z„,Z„) -f 0(3). (19.3.15) 
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Let us simplify further the term 

h2{Zn+l,Zn+l) (19.3.16) 

in the right-hand side of (19.3.15). Clearly we have 

Zn+l = XZn + 0{2), 

z„+i = Az„ + 0(2), (19.3.17) 

so that 

h2{Zn+l,Zn+l) = ft-2 (A^n , Az„) -|-C>(3). (19.3.18) 

Substituting (19.3.18) into (19.3.15) gives 

Zn+l = Az„ -|- Xh 2 (Zn, z„) — /i 2 (Az„, Az„) + F 2 + 0(3). (19.3.19) 

Therefore, we can eliminate all second-order terms provided we can find 
h 2 {z, z) so that 

Xh 2 {z, z) - h 2 {Xz, Xz) + F 2 = 0. (19.3.20) 

As in all other situations involving normal forms that we have encountered 
thus far, this involves a problem from elementary linear algebra. This is 
because the map 

/i 2 (z, z) I — > Xh 2 {z, z) — /i 2 (Az, Xz) (19.3.21) 

is a linear map of 7^2 into Fl 2 where 

iJ 2 = span . (19.3.22) 

In order to compute a matrix representation for (19.3.21) we need to com- 
pute the action of (19.3.21) on each basis element in (19.3.22). This is given 
as follows 



Xz^ — }?z^ = A(1 — A)z^, 

Xzz — XXzz = A(1 — X)zz, 

Xz^-X^z^ = (X-X^)z^. (19.3.23) 



Using (19.3.23), the matrix representation for (19.3.21) with respect to the 
basis (19.3.22) is 



'A(m)(1-A(m)) 0 0 

0 a(m)(i-A(m)) o_ 

0 0 A(/i) — A(/i)^ 



Now, by assumption, we have 



A(0) 



1 

m' 



(19.3.24) 



|A(0)| = 1 



and 



(19.3.25) 
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Therefore, (19.3.24) is invertible at = 0 provided 
A(0) ^ 1, 

"^(0) 7^ A(0)2 ^ (19.3.26) 

If (19.3.26) are satisfied at /i = 0, then they are also satisfied in a suffi- 
ciently small neighborhood of /i = 0. Therefore, if (19.3.26) are satisfied, 
then all second-order terms can be eliminated from the normal form for fx 
sufficiently small. 



Simplify Third- Order Terms 

Using an argument exactly like that given above, third-order terms can be 
eliminated provided 



Xhsiz, z) - hz{Xz, Xz) -1- Tb = 0. 


(19.3.27) 


The map 




h^{z, z) 1 — )> Xh^{z, z) - h^{Xz, A^) 


(19.3.28) 


is a linear map of into where 




= span {z^, z'^z, zz^, . 


(19.3.29) 



The action of (19.3.28) on each element of (19.3.29) is given by 



= A(l-A2)2^ 

Xz'^z — }?\z^z = A(1 — AA)z^z, 

\zz^ — X^Xzz^ = A(1 — )?)zz^, 

Xz^ -X^z^ = {X-X^)z^. (19.3.30) 



Thus, a matrix representation of (19.3.28) with respect to the basis 
(19.3.29) is given by 



f X{p,){l- X{p,f) 0 _ 0 

0 A(/x)(l - A(^)A(/i)) 0_ 

0 0 A(/x)(l - A(/x)2) 

0 0 






0 



V 

Recall that at /i = 0 we have 
|A(0)| = 1, 



ht) - 

(19.3.31) 



J 



A(0) = 



A(0)’ 



(19.3.32) 



so that at /i = 0 the second column of (19.3.31) is all zero’s. The reader 
can easily check that the remaining columns are all linearly independent 
at /r = 0 provided 



A^(0) ^ 1, A"(0) ^ 1. 



(19.3.33) 




288 



19. Normal Forms 



This situation will also hold for /i sufficiently small. Therefore, the normal 
form is as follows 

zi — > X(fj,)z + + 0{4), (19.3.34) 

where is a constant, provided 

A”(0) yf 1, n=l,2,3,4 



for fj, sufficiently small. 

More generally, simplification at order k depends on how the linear op- 
erator 

hk{z, z) I — > Xhk{z, z) — hk{Xz, Xz) (19.3.35) 

acts on elements like h = z^z^ where m -I- n is the order of the term one 
wishes to simplify. Substituting this into the above equation gives 

- A"A™z”z™ = A(1 - A"-^A’”)2”0™. (19.3.36) 

At /i = 0, A = 1/A; hence, we cannot have 

A”-™-i(0) = 1. (19.3.37) 

We leave it to the reader to work out general conditions for the elimination 
of higher order terms based on (19.3.37). 

Differentiability 

We make the important remark that in order to obtain the normal form 
(19.3.34) the map must be at least C"^. 



19.4 Exercises 

1. Prove that is a linear vector space. 

2. Suppose hk{x) G Hk {x G R”"), and J is an n X n matrix of real numbers. Then prove 
that the maps 

a) hk(x) !->■ Jhk(x) - Dhk{x)Jx = Dff (hk(x)), 

b) hk{x) !->■ Jhk{x) — hk(Jx) = M^ff{hk(x)), 
are linear maps of Hk into Hk- 

3. Argue that, for y G R”' sufficiently small, 

a) (id -\-Dhk{y))~^ exists 
and 

b) (id +Dhk{y))~'- = id -Dhk(y) 

for hk{y) G Hk- Similarly, show that for y G R”' sufficiently small 



:) (id-l-ftfc) ^(y) exists 
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and 

d) (id+7ifc)“i(y) = (id-/ifc H ){y). 

4. Compute a normal form for a map in the neighborhood of a fixed point having the 
linear part 




through second-order terms. 

Compare the resulting normal form with the normal form of a vector field near a fixed 
point having linear part 

(s ;) 

(see Example 19.1.2). Explain the results. 

5. Consider a third-order autonomous vector field near a fixed point having linear part 

/O -oj 0\ 

[ LU 0 0 

\ 0 0 0 / 

with respect to the standard basis in Show that in cylindrical coordinates a normal 
form is given by 

r = airz -\- a 2 r^ + 0(4), 

z — + h2Z^ b^r^z -h 642^ - 1 - 0 ( 4 ), 

9 — uj c^z 0 ( 2 ), 

where ai, U 2 , < 13 , fci, b 2 , ^ 3 , & 4 , and ci are constants. {Hint: lump the two coordinates 
associated with the block 




into a single complex coordinate.) 

6 . Consider a four-dimensional (r as large as necessary) vector field having a fixed 
point where the matrix associated with the linearization is given by 

( 0 —u)\ 0 \ 

ioi 0 0 0 I 

0 0 0 — OJ2 I 

0 0 CJ2 0 / 

Compute the normal form through third order. {Hint: use two complex variables.) 
You should find that certain “resonance” problems arise; namely, the normal form will 
depend on muJi ncu 2 7 ^ 0, \m\ |n| < 4. Give the normal form for the cases 



a) 


mcui - 1 - nco 2 


= 0 , 


Im-I 


+ 


|n| 


= 1 . 


b) 


muJi - 1 - nco 2 


^ 0 , 


Im-I 


+ 


|n| 


^ 2 . 


c) 


muJi n<jj 2 


^ 0 , 


|m| 


+ 


|n| 


- 3. 


d) 


mu)i n<jj 2 


^ 0 , 


|m| 


+ 


|n| 


^ 4. 


e) 


mu)i - 1 - nL 02 7^ 0 , 


|m| 


+ 


|n| 


< 4. 



7. Consider the normal form for a map of in the neighborhood of a fixed point where 
the eigenvalues of the matrix associated with the linearization, denoted Ai and A 2 , are 
complex conjugates, i.e., Ai = A 2 , and have modulus one, i.e., |Ai| = IA 2 I = |A| = 1 
(cf. Example 19.3a). Compute the normal form for the cases. 



a) A = 1. 

b) A^ ^ 1 . 

c) A^ ^ 1 . 
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d) A"‘ = 1. 

8. Compute the normal form of a map of in the neighborhood of a fixed point where 
the matrix associated with the linearization has the following form 




Compare your normal forms with those obtained in parts a) and b) of Exercise 7. 

9. Consider a (r > 2) vector field 

X — X G (1 G 

defined on a sufficiently large open set in X IR^. Suppose that = (0,0) is 

a fixed point of this vector field and that Dxf{0,0) has a pair of purely imaginary 
eigenvalues. 



a) Show that there exists a curve of fixed points of the vector field, denoted 
ai{0) = 0, for ^ sufficiently small. 

b) By using this curve of fixed points as a parameter-dependent coordinate trans- 
formation, show that one can choose coordinates so that the origin in phase 
space remains a fixed point for /i sufficiently small. 



19.5 The Elphick-Tirapegui-Brachet-Coullet-Iooss 
Normal Form 

In this section we will describe the work of Elphick et al. [1987]. A key 
feature of their work is that they introduce an inner product on H^. This 
inner product can be used to construct the orthogonal complement to Lj . 
The resulting normal form has a number of nice properties. Moreover, their 
method seems to be computationally very efficient. Similar work by Cush- 
man and Sanders [1986] appeared simultaneously, however all of the results 
described in this section are taken from Elphick et al. [1987]. 

For easier reference we recall (19.1.8), which we wish to transform into 
normal form: 

X = Jx 7 ^ 2 ( 2 ;) -|- F^{x) -|- • • • -l- F , — i(a^) -l- (H(|a;|’'), 

= Jx + F{x) + 0{\x\^), (19.5.1) 

where Fi{x) represent the order i terms in the Taylor expansion of F{x) 
and F{x) represents the nonlinear terms in this expansion through order 
r — 1. 
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19.5a An Inner Product on Hk 

First we define an inner product on the scalar valued homogeneous polyno- 
mials of degree k, and then use this to construct an inner product on Hk- 
Let X = (xi, . . . ,Xn) and let p{x), q{x) denote scalar valued homogeneous 
polynomials in x of degree k. For example, 

P{x)= ^ •••a;™", (19.5.2) 

_| = fc 

miez+u{0} 



where denotes the positive integers, and denotes a scalar. 

Following Bargmann [1961], Elphick et al. [1987] define the following inner 
product on the space of scalar valued homogeneous polynomials in x of 
degree k 

{p,q) =p{d)q{x)\x=o, (19.5.3) 

where p{d) is the symbol for a homogeneous polynomial of degree k in 
(9i, • • • , dn) with di = For example, for p{x) as defined in (19.5.2), we 
have 

^ Qmi Q'tTLn 

p{d)= (19.5.4) 

m.i + ---+m„ = k 1 ” 

mieZ+U{0} 

The following example computation is instructive. Let 

n n 

p(x) = •••X™", (7(x) = • • • X™" , = 

2=1 2=1 



Then 



{p, q) 



Qmi 



Qm„ \ 



(x 



mi 

1 



= n 



W 



n 

^nii ■ • 



There are two basic properties of this inner product that we now derive. 
Property One 

Let p{x), q{x), r{x) denote scalar valued homogeneous polynomials in x of 
degree k. Then it follows from (19.5.3) that 



(qr,p) = q{d)r{d)p{x)\x=o, 

= r{d)q{d)p{x%=o, 

= {r,q{d)p). (19.5.5) 



This equation states that multiplication by the polynomial q{x) is the ad- 
joint of differentiation by q{d). 
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Property Two 

Another key property is the following. 

Suppose A : R" — >■ R" is a linear, invertible operator. Then the second 
basic property is the following 

{p{Ax),q{x)) = {p{x),q{A*x)), (19.5.6) 

where A* denotes the adjoint of A. We will leave the proof of this property 
to the exercises. 

Now we construct an inner product on With respect to the chosen 
basis on R”, the corresponding coordinates of a point x G R" are denoted 
X = (xi, . . . ,Xn)- Then, with respect to this same basis, a vector valued 
homogeneous polynomial of degree k (i.e., an element of is denoted 
hk{x) h^ix)), where h1{x) is a scalar valued homogeneous 

polynomial of degree k for each i. Then an inner product on Hk is defined as 
follows. For hk, gk G Hk, the inner product of these two vectors is defined 

by 

n 

{hk,9k)H,=Y.^K,9l)- (19.5.7) 

i=l 

Since we are dealing with real vector fields note that A* is the adjoint 

with respect to the standard Euclidean inner product on R", i.e., it is 
the transpose of A, denoted . We could have just as easily developed 
the theory for vector fields on C". In that case A* would be the adjoint 
with respect to the usual inner product on C". Then the inner product 
(19.5.3) would be modified as follows: 

= p{d)q{xA^o- 

The complex case will be useful when we have imaginary eigenvalues 
and perform calculations using the complexification of the real case. 

19.5b The Main Theorems 

With this inner product in hand, we can now prove the main results that 
describe the structure of the normal form. 

Suppose hk, gk G Hk and A : R" — >■ R” is a linear invertible operator. 
Then 

{A~^gk {Ax),hk (x))Hk = {gk {Ax),A*~^hk {x))Hk, 

= {gk{x) ,A*~^ {hk{A*x)))Hk, using (19.5.6). 

(19.5.8) 

In (19.5.8) substitute A = e'^*. Then (19.5.8) becomes 

{e~'^*gk , hk {x))Hk = (hk (a;) , e~'^ %k , 



(19.5.9) 
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where we have used = e'^ Next we differentiate (19.5.9) with 

respect to t and evaluate the result at t = 0 to obtain 

{-Jgk{x) + Dgk{x)Jx, hu{x))Hk = {9k{x), -J*hk{x) + Dhk{x)J*x)Hk, 

(19.5.10) 

or, using the notation developed in (19.1.23) and (19.1.24), 

{[9k^ Jx] , {gk^i^kT J (19.5.11) 

or 

( 4 ") {9k),hk)H, = {9k, Lf} {hk))H„ (19.5.12) 

from which it follows that 




(k) 

But most importantly, this expression implies that if hk G Ker LjJ then 
hk is in the orthogonal complement of the image of L^j\ denoted Im L^j\ 
This is a key feature of the normal form, but before summarizing this in a 
theorem we want to develop a useful characterization of Ker LjJ . 

First, we argue that 

e^J**hk{x) = ■ (19.5.13) 

This can be seen as follows. Consider the following linear ordinary differ- 
ential equation on Hk' 

hk = hk, hk € Hk- 

It is a straightforward calculation to verify that 

mk{t) = e~'’**hk , 

rik{t) = e J* hk{x), 

are both solutions of this equation satisfying 

TOfc(O) = hk{x), 
rifc(O) = hk{x). 

Hence, by existence and uniqueness of solutions of linear ordinary differen- 
tial equations with constant coefficients, it follows that mk{t) = rifc(t) for 
all t. Therefore (19.5.13) holds. 

T (k) 

Now e J* acts as the identity map on KerLj. . Hence it follows that 
Ker = ^hk G Hk \e~'^ *hk ^x'j = hk(x), Vt G k| . (19.5.14) 

We can summarize these results in the following theorem. 
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Theorem 19.5.1 (Elphick-Tirapegui-Brachet-Coullet-Iooss) Hk can 

he decomposed as follows: 

Hk = ImL^P © KerL%\ 

where 

KerL^j} = ^hk G Hk \ e~'^ *hk = hk(x), Vi G k| . 

Applying this result to successive orders of the Taylor expansion of 
(19.5.1), we can immediately state a theorem characterizing what we mean 
by the normal form of (19.5.1) 

Theorem 19.5.2 (Elphick-Tirapegui-Brachet-Coullet-Iooss) The 

vector field (19.5.1) is said to be in normal form through order r — 1 if the 
nonlinear terms T{x) eommutes with * is the sense that 

= V^(x). (19.5.15) 

Or, equivalently, if T{x) satisfies the following partial differential equation 



DT{x)J*x - J*T{x) = 0. 



(19.5.16) 



Note that the partial differential equation (19.5.16) can be solved by the 
method of characteristics. Writing (19.5.16) out in components gives 

dT, 



Y. - Y. z = 1, . . . ,n. (19.5.17) 

j,i=i ^1 j=i 

The characteristic system associated with (19.5.17) is given by 

dJ-i 



dxj 



i, j = l,...,n. 



(19.5.18) 



X)/ Jlj^l Xi Jli^l 

It follows immediately that the characteristic curves in K" are given by 

x(f) = (19.5.19) 



This result is useful for giving another characterization of the nonlinear 
terms of the normal form. 



Theorem 19.5.3 (Elphick-Tirapegui-Brachet-Coullet-Iooss) The 

nonlinear terms of the normal form can be written in the following form: 

n 

j^{x) = Y {x)£ix, (19.5.20) 

where Li, i = 1, . . . ,n are linear operators commuting with J* in K" (to 
be explicitly constructed in the proof), and the scalar functions ai{x), i = 
1, . . . ,n, are rational functions that are first integrals of the characteristic 
system x = J*x. 
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Proof: First we construct the linear operators £i, . . . , To do this, we will 
assume that J* is in Jordan canonical form. Then it has r Jordan blocks, 
with each block J* corresponding to an invariant subspace of K”, denoted 
Ej, and to an eigenvalue Xj, j = 1, . . . ,r. Let denote the dimension of 
Ej and Pj denote the projection onto Ej. Then 

r 

To each Jordan block we associate Vj linear operators 

P„ ( j; - A, id) Pj , . . . , (j; - A, id)"^-' J = 1, . . . , r. 

This is a set of n linearly independent operators, with each commuting with 
J* , which we denote by Ej. Now we choose any x G M" having the property 
that it has a nonzero component in each of the Ej. Then EjX, j = 1, . . . , n 
forms a basis for M". We express E{x) in this basis to obtain (19.5.20). 
Next we examine the properties of the coefficients in this expansion, aj(x). 
Since 

j = l,...,n, 

it follows that 

e^'^Cj ^ j = l,...,n. (19.5.21) 

Now using (19.5.15), we obtain 

n 

g-j tjp *aj 

1=1 
n 

= E 6”“^ *EjX, 

1=1 

n 

— E! CjX, 

1=1 

n 

= ^{x) = ^aj{x)CjX. 

1=1 



Hence 

aj = aj{x). ( 19 . 5 . 22 ) 

Therefore the functions o.j{x) are first integrals of the characteristic system. 

□ 
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19.5c Symmetries of the Normal Form 

Suppose T : M" — >■ R" is a linear invertible operator. Consider an arbitrary 
vector field 

x = f{x), xGM". (19.5.23) 

We say that T is a symmetry of (19.5.23) if 

f{Tx) = Tf{x). (19.5.24) 

If (19.5.23) has an equilibrium point at a; = 0, and J is the matrix associ- 
ated with the linearization about this equilibrium, then by differentiating 
(19.5.24) we obtain 

JT = TJ. (19.5.25) 

The terminology “T commutes with the vector field” is also used. 

From our construction of the normal form we see immediately that the 
one-parameter group of transformations tcRjisa symmetry for 

the normal form. In this section we will prove that if the vector field has an 
additional symmetry, denoted by the linear invertible operator T : M" — >■ 
M", then the normal form as constructed above, also has the symmetry T. 
To begin with, we define the following linear operator on Hk- 

nhk{x) = T~^hk{Tx), (19.5.26) 

which will be useful for proving the following lemma. 

Lemma 19.5.4 The image of L^j\ ImL^j\ and the kernel of L^j\ 
KerL^j \ are invariant under T*. 

(k) 

Proof: First we show that T* commutes with Lf ' in the following sense: 

Lj ^ inhk{x)) = Lf'> {T-^hk{Tx)) = JT~^hk{Tx) - D {T-^hk{Tx)) Jx 

= JT~^hk{Tx) - T~^Dhk{Tx)TJx 
= T~^Jhk{Tx) - T~^Dhk{Tx)JTx 
= T~^ {Jhk{Tx) - Dhk{Tx)JTx) 

= {hk{Tx)) = {hk{x )) , 

(19.5.27) 



where in this calculation we have used 

JT = TJ ^T~^J = JT~^. 

The lemma follows directly from this calculation. □ 

Now assume that T is unitary, i.e., T* = T~^ . Then we have the following 
lemma. 
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Lemma 19.5.5 The image of L^j} , ImL^j} , and the kernel of L^j} , 
KerL^j} , are invariant under T*. 

(k) 

Proof: First we show that T, commutes with LyJ in the following sense: 



if} inhkix)) = LfJ {T-^hk{Tx)) 



(k) / rp—l 7 



= J*T-^hk{Tx) - D {T-^hk{Tx)) J*x 
= J*T-^hk{Tx) - T-^Dhk{Tx)TJ*x 
= T-'^rhkiTx) - T-^Dhk{Tx)J*Tx 
= T-i {J*hk{Tx) - Dhk{Tx)J*Tx) 

= {hk{Tx)) = nLfJ {hk{x)) , 



(19.5.28) 



where in this calculation we have used 



J*T = TJ* ^ T~^J* = J*T~ 



The lemma follows directly from this calculation. □ 

(k) (k) 

The two lemmas show that both Im Ly and Ker LyJ are invariant under 
T*. We will use these two facts to show that the normal form, as constructed 
above, commutes with T. 

First, suppose the vector field (19.5.1) has the symmetry T. Then we 
must have 



X = T-^JTx + T~^F2 {Tx) + T~^F^{Tx) + • • • , 

= Jx F F2 {x) + F^(x) + • • • . 

Now we consider simplifying the second order terms. Making the change of 
coordinates 

X ^ X + h 2 {x), 

gives 

X = Jx + (^ 2 ( 2 ;)) + F2{x) + 0 ( 3 ), 

or 



x = Jx + if {h 2 {x)) + rf“^5''V2(x) + F 2 (x) + 0(3), 



where 11^™ denotes the projection onto ImLj^^ and denotes 

(2) 

the projection onto Ker Lyf . 

We next apply the symmetry transformation to this equation, i.e., we let 
X I— >■ Tx, and act on the equation from the left with T~^ to obtain 



X = T-^JTx + T~^lP {h2{Tx)) + 
+ F2(Tx) + 0(3). 
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( 2 ) ( 2 ) 

Now KerLj, and Im Lj, are invariant under T*. Hence T* commutes with 
the projection operators, which allows us to rewrite this equation as 

X = Jx + n (/i2(x)) + +U^^^^?h-^F2(Tx) + 0 { 3 ). 

Since h 2 {x) can be chosen such that 

Lf {h2{x)) + U^^^'"'F2{x) = 0, 



it follows that 

x = Jx + T~^F 2 {Tx) + 0(3), 

or 

x = Jx + F 2 (x) + 0(3), 

Hence, through 0(2) terms, the normal form has the symmetry T . Clearly, 
this same argument can be continued at subsequent higher orders. In this 
way, we arrive at the following theorem. 

Theorem 19.5.6 (Elphick-Tirapegui-Brachet-Coullet-Iooss) Sup- 
pose T : M” — >■ K" is unitary, and the vector field (19.5.1) has the symme- 
try T. Then the normal form also has the symmetry T. 



19. 5d Examples 

Now we will give some examples of normal forms computed with this 
method. 



Example 19.5.1 (The “Double-Hopf” Bifurcation). 

We will compute the normal form for the so-called “double Hopf bifurcation” . 
By this we mean that the matrix associated with the linearization about an 
equilibrium has a pair of pure imaginary eigenvalues, Uloq, FiuJi, and we will 
assume that the matrix associated with the linearization is diagonalizable in a 
complex basis (semisimple). In this case the Jordan canonical form for the matrix 
is 



( iujo 0 0 0 \ 

0 —iuJo 0 0 

0 0 iu>i 0 

\ 0 0 0 —iu>i ) 



(19.5.29) 



with 



0 0 0 \ 

0 0 0 

0 0 0 

V 0 0 0 



(19.5.30) 



Since the first and second components are complex conjugates, as well as the 
third and fourth, we only need to be concerned with the first and third compo- 
nents of the normal form. 
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The component (i = 1, ... ,4) of a typical element of Hp+q+r+s is of the 
form 

gi{zo,zo,zi,zi) = z^z^z{zt. 

We want to determine the conditions on the nonnegative integers p, q, r, s such 
that the commutation relation holds, i.e., we must compute 

(e"‘^ *) gi{e-^ ^z) = gj{z), j = l,...,4, 

^ '' 3 A 

where 2 = {zq, zo, zi,zi). Since e~'' * is diagonal, and the first and second compo- 
nents of the normal form are complex conjugates, as well as the third and fourth, 
the computation reduces to 



(e gi{e-^ *z) = gi{z), i = l,3. 

V / i.i 



The first and third components of this commutation relation are easily found 
to be: 



First Component: 



+ p-q r-3 _ p -q r -s 

e ZqZqZiZi — ZqZqZiZi, 



(19.5.31) 



Third Component: 



-i((p-q)i^O + (r-s-l)uii)t p-q r-s _ p r -s 

e ZqZqZiZi — ZqZqZiZi- 



(19.5.32) 

These conditions give rise to the following relations between p, q, r, s: 

First Component: 

{p — q — l)oJo + (r — s)a;i = 0, (19.5.33) 

Third Component: 



(p — q)u)Q + {r — s — l)aii = 0. (19.5.34) 

There are two general situations to consider; the nonresonant and the resonant 
cases. 



Nonresonance: — Irrational 

Uli 

In this case, since wo and oj\ are independent over the rational numbers, (19.5.33) 
has the unique solution p = q + \ and r = s. From this it follows that the terms 
in the first component of the normal form have the form 

zo\zo\^'^\zi\^"' . 

Similarly, (19.5.34) has the unique solution p = q and r = s -f 1, and from this it 
follows that the terms in the third component of the normal form have the form 

zi\zo\‘^'^\zif"'. 

Thus the normal form for the nonresonant double Hopf bifurcation is of the form 
Zo = imzo + zoPo{\zof, \zif), 

ii = iuizi + 2iPi(|2:o|^, \zi\^), (19.5.35) 

where Po and Pi are polynomials in \zo\'^ and \zi\^. 
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Resonance: — Rational 

UJi 

Suppose ^ where m and n are integers with all common factors cancelled 

from the ratio (in this case we say that m and n are relatively prime or coprime, 
and write (m,n) — 1). In this case the solution to (19.5.33) is 

p = <7 + 1 + kn, r = s — km, fc G 2, 

which leads to terms of the form 



I i2gi i2s kn —km 
2oFo| |^:i| Zq 



(19.5.36) 



For this expression we must consider the cases k > 0 and k < 0. 



fc > 0: Using 



(19.5.36) can be rewritten as 



-1 _ Zl 

UiP’ 



I 1 2g I 1 2r kn —km 

2:o|2;o| ^\zi\ Zq Zl . 



A: < 0: Using 



(19.5.36) can be rewritten as 



Zo = Zq ^\zo\^, 



I i2(p— l)i i2s— — fcn —km 
2o|2:o| ^IZlI Zq Zl , 



-n — 1 mi |2pi |2s-— (/c + l)n — (fc + l)m 

Zq Zl \zo\ ‘^Izil Zq^ ^ ^ Zi^ ' . 



The solution to (19.5.34) is 



r = s + 1 + km, p = q — kn, fc G 2, 



which leads to terms of the form 



I i2gi i2s —kn km 
Zlizol ^|^l| Zq Zl . 



(19.5.37) 



For this expression we must consider the cases fc > 0 and fc < 0. 



fc > 0: Using 



(19.5.37) can be rewritten as 



-1 _ Zq 

“PoP’ 



I i2si i2p— fcn— fcm 

Zilzil |2:o| ^ 2:0 Zl . 



A: < 0: Using 



(19.5.36) can be rewritten as 



Zl = Zl 



I i2(r — 1)1 i2g —kn— — km 
Zi\zi\ 'l^ol ^Zq Zl 
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Hence, the normal form takes the following form 

• • I Tt n |2 I |2 n—m\ , — n— 1 m t* /i |2 i i2 — n m\ 

zo = lUoZo + ZoPo{\Zo\ ,\zi\ ,ZqZi )+Zq Zi Pl(|2o| ,|2l| , Zq Zi ), 

• • I /I |2 I 1 2 — n m\ , n— m — /i |2 i i2 n— m\ 

Zl = + 2 iQo(|2o| ,|2l| ,ZoZi)+ZoZi Ql(|2o| ,|2l| ,ZoZi), 

(19.5.38) 

where Pq, Pi, Qq and Qi are polynomials in their argnments. 

End of Example 19.5.1 



Example 19.5.2. We compute the normal form for a two-dimensional vector 
field with linear part given by 




We will compute the normal form by solving the characteristic system (19.5.18), 
which for this example is given by 



X 



dPi 

dy 



= 0 , 



dFh 

' dy 






The solution is given by 



Fi{x,y) = x(f)i{x), 

P 2 {x,y) = y<j}i{x) + cj> 2 {x), (19.5.39) 

where (j>i and <^2 are polynomials in x (this is proved in Elphick et al. [1987], but 
it can easily be verified by substitution). Hence, KerL^i^ is two dimensional, and 
a general vector in this space is given by 

/ k k—1 I T k\ 

I ax , ayx -I- oa; I . 

Hence, the normal form is given by 

x = y + Po{x), 

ij — xyPi{x) -(- x^p 2 {x), (19.5.40) 

where Pi(x), i = 0, 1, 2, are polynomials in x. 

The normal form can be simplified if we add to Ker the vector 

( k , fc-i \ 

( —ax , kax y\ , 

(k^ 

which is orthogonal to KerLj,'. This amounts to choosing a different space com- 
(k) 

plementary to Im LjJ . The resulting complementary space has the form 

^0, ayx^~^ -b , 
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and the normal form is given by 

x = y, 

y = xyP\{x) + x^P^ix), (19.5.41) 

where Pi(x) and P 2 (x) are polynomials in x. 



End of Example 19.5.2 



19. 5e The Normal Form of a Vector Field 
Depending on Parameters 



Essentially the same theory goes through for vector fields depending on 
parameters, but with a few interesting twists. 

We rewrite the vector field (19.5.1), where T is 0(r — 1) in x: 





X = Jx-\- T{x, fi), X € M", ^ G 


(19.5.42) 


with J in 


Jordan canonical form, and not depending on /r, and 






T{0,tx)=O(\^x\), D,T{0,0) = 0. 




Theorem 19.5.7 The vector field (19.5.42) can he transformed to a nor- 
mal form in which lF{x,fx) satisfies 




e = T{x,y), 


(19.5.43) 


and 


T{Q,^x) G Ker.r, 


(19.5.44) 




rD,T{D,yL) - D^T{D,pi)r = 0. 


(19.5.45) 



Proof: The basic idea here, as we mentioned earlier, is that we Taylor 
expand in x and view the coefficients as functions of the parameters /i. In 
this way, the previous theory goes through in exactly the same way and 
the parameters “just go along for the ride” . However, two new terms do 
arise in this way, but they cause little difficulty as we will now see. 

Taylor expanding (19.5.42) in x gives 

x = Jx + fx) + fi)x + 0(\x\^). (19.5.46) 

As in the situation with no parameters, we choose the (|a;|^) terms to be in 
Ker L^j} ■ In the case with parameters there are two new terms; T(0, fx) G Hq 
(constant terms) and DxJ^{0,fx) G Hi (linear terms). Now 

(JG(0, fx)) = m) = 0 ^ T(0, fx) G Ker J*, 
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and 

L^j} p,.F(0,/i)) ^ rD,T{0,^i) - D,T(0,fx)r = 0. 

Requiring the 0{\x\p) terms to be in Ker L^j} , p>2 immediately leads to 

e~J tj: = T{x,p). 

This completes the proof of the theorem. □ 



Example 19.5.3. 

We consider the example of the double-zero eigenvalue with non-semisimple 
linear part 




with the vector field in the form 



\y) \y) ^ T 2 y{Q,y) ) 



+0{\x\\\yf). 



Then we have 

/0 0\ (T,{Q,y)\ 

\1Q) \T2{Q,p)) 

and 

/"0 0\ / ^1.(0, /i) ^1^(0, 

ylOy \J^2x{^, y) ^2y{^, y) ) 



f ° 

V^i( 0 ,m) 



^ = 0 , 



/^lx(0,/l)^ly(0,y)\ foo) ^ foo) 

[^2x(0,M)J^2y(0,y)J[l0j [ooj 



^ :Fly(0, /J.) = 0, J^lx(0, y) = J^2y(0, y). 

Then the vector field takes the form 



X = y + J^ix(0,f^)x, 

y = :F 2 { 0 , y) + T 2 x{Q, y)x + Tix{0, y)y + 0 ( 12 : 1 ^, |y|^)- 



End of Example 19.5.3 

We make some final remarks concerning extensions of the Elphick- 
Tirapegui-Brachet-Coullet-Iooss approach to normal forsm. 

Normal Form for a Vector Field Near a Periodic Orbit 

looss [1988] has extended the work of Elphick et al. [1987] to the situation 
of computing the normal form of a vector field in the neighborhood of a 
periodic orbit. 

Normal Form for a Map Near a Fixed Point 

Chen and Della Dora [1999] have extended the work of Elphick et al. [1987] 
to the situation of computing the normal form of a map near a fixed point. 
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19.6 Exercises 



1. Prove that 

{p{Ax),q{x)) = (p{x),q{A*x)). 

(Hint: Apply the chain rule and show that dx — AOy where y — A*x.) 

2. Prove that 



{A (Ax) ,hk {x))h^ = {gk (x) , A* {hk {A'x)))Hk- 



3. Prove that (•, •)h^ is an inner product on 

4. Let {ei}2=i denote the standard basis on and let (xi, . . . , Xn) denote coordinates 
with respect to this basis. We denote two arbitrary elements of by 



i = l m^H \-mn=k 






9k=Yl E 9l:, 

i = l mi H \-mn=k 



X-, Xry ■ ■ ■ X„ ei 



Compute the inner product of hk and gk- 

5. Prove the following result from Elphick et al. [1987]. If J is diagonalizable, then 

KerL^j} ^ Kev , 



and the normal form can be constructed so that it commutes with t G K. 

6. Consider a two-dimensional vector field having an equilibrium point at the origin with 
the linear part of the vector field given by 



J = 




Suppose also that the vector field commutes with the linear map defined by the fol- 
lowing matrix 




Compute the normal form. 

7. Consider a three-dimensional vector field having an equilibrium point at the origin 
with the linear part of the vector field given by 



/ 0 oj 0\ 

J ^ -oj 0 0 , 

\ 0 0 0 / 

where u; > 0 is a real number. Suppose also that the vector field commutes with the 
linear map defined by the following matrix 

/ 1 0 0 

T = 0 1 0 

\0 0 -1 



Compute the normal form. 
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8. Consider the non-semisimple double-Hopf bifurcation at 1 : 1 resonance, i.e., a four- 
dimensional vector field having an equilibrium point at the origin with the linear part 
of the vector field, with respect to a complex basis, is given by 

/ iuj 1 0 \ 

j _( 0 iuj 0 0 I 

^ “ 1 0 0 -iuj 1 I ’ 

V 0 0 0 —iio / 



where a; > 0 is a real number. Show that in complex coordinates the normal form can 
be written as follows 

zi — iujz\ 22 , 

Z 2 = iUJZQ. + Z101 (|zi|, ZlZ2 - Z 1 Z 2 ) + Z202 (|zi|, Z 1 Z 2 - Z 1 Z 2 ) , 

where and <p 2 are polynomials in their two arguments. 

9. Recall the example of the non-semisimple double-zero eigenvalue depending on pa- 
rameters. We showed that the normal form could be transformed into the following 
form 



X = y + Fix(0, y.)x, 

V = ^ 2 ( 0 , /z) + J^2x{0, y.)x + Fix(0, y.)v + +0{\x\^ , \v[‘)- 



“Re-parametrize” by letting 



JF2(0, ^ , 

^la;(0, ^ ^2, 

^2x(0, fJ.) ^ fJ. 3 . 



Show that the fj, 2 X and fisx term cans be eliminated by appropriate linear transfor- 
mations. 

10. Consider the “double-Hopf bifurcation” as described above, both the resonant and non- 
resonant cases. Suppose the associated vector field depends on parameters. Compute 
the terms (19.5.44) and (19.5.45). 

11. Consider a two-dimensional vector field having an equilibrium point at the origin with 
the linear part of the vector field given by 



J = 




Suppose also that the vector field commutes with the linear map defined by the fol- 
lowing matrix 




Now consider the situation where the vector field depends on parameters. Compute 
the terms (19.5.44) and (19.5.45). 

12. Consider a three-dimensional vector field having an equilibrium point at the origin 
with the linear part of the vector field given by 

/ 0 oj 0\ 

J = -oj 0 0 , 

\ 0 0 0 / 



where u; > 0 is a real number. Suppose that the vector field depends on parameters. 
Compute the terms (19.5.44) and (19.5.45). 
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Suppose also that the vector field commutes with the linear map defined by the fol- 
lowing matrix 

/ 1 0 0 
T = 0 1 0 

\0 0 -1 



Compute the terms (19.5.44) and (19.5.45). 

13. Consider the non-semisimple double-Hopf bifurcation at 1 : 1 resonance, i.e., a four- 
dimensional vector field having an equilibrium point at the origin with the linear part 
of the vector field, with respect to a complex basis, is given by 

( icj 1 0 ^ \ 

0 za; 0 0 1 

0 0 -iio 1 

0 0 0 



where cj > 0 is a real number. Suppose that the vector field depends on parameters 
and compute the terms (19.5.44) and (19.5.45). 

14. Develop the Elphick-Tirapegui-Brachet-Coullet-Iooss normal form for maps. 



19.7 Lie Groups, Lie Group Actions, and 
Symmetries 

In this section we develop some of the terminology and tools to discuss 
issues related to symmetries in dynamical systems, which we will visit from 
time-to-time throughout the rest of this book. We begin by recalling the 
definition of a group. 

Definition 19 . 7.1 (Group) A group is a set, G, equipped with a binary 
operation on the group elements, denoted and referred to as group 
multiplication, which satisfies the following three properties. 

1 . G is closed under group multiplication, i.e., g\, g2 € G ^ 91*92 G G. 

2 . There exists a multiplicative identity element in G, i.e., there exists 
an element e € G such that €*9 = 9*9 = 9, for any 9 € G. 

3 . For every element of G there exists a multiplicative inverse, i.e., for 
every 9 € G, there exists an element, denoted 9~^, such that 9*g~^ = 
9 ~^ *9 = e. 

4. Multiplication is associative, i.e., {91 *92) *93 = 9i* {92 *93), for any 
9 i, 92 , 93 G G. 

Most of the groups which we encounter will be Lie groups. Roughly 
speaking, a Lie group is a group that also has the structure of a differen- 
tiable manifold. However, rather than developing the necessary machinery 
for the theory of differentiable manifolds, we will introduce the notion of 
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Lie groups from a more elementary point of view (following Golubitsky et 
al. [1988]) that will be more than adequate for our purposes. 

Let GL(M") denote the group of linear, invertible transformations of M” 
into M", which we can view as the group of nonsingular nxn matrices over 
M. Then we have the following definition. 

Definition 19.7.2 (Lie Group) A Lie group is a closed subgroup of 
GLfMA), which we will denote by L. 

Recall that a subgroup is a subset of a group, which obeys the same 
axioms as a group (but with the important point that it is the subset that 
is closed with respect to group multiplication). The term “closed” in the 
definition of a Lie group needs clarification. We can identify the space of 
all n X n matrices with i?" . Then GL(R") is an open subset of i?" . T is 
said to be a closed subgroup if it is a closed subset of GL(K”), as well as 
a subgroup of GL(K”). If this closed subset is compact or connected then 
the associated Lie group is also said to be compact or connected. 

In the context of symmetry properties of dynamical systems, we will be 
concerned with transformations of the phase space (and, in some situations, 
parameter space) under a Lie group. This brings us to the idea of a group 
action. 

Definition 19.7.3 (Lie Group Action on a Vector Space) LetV de- 
note a Lie group and V a vector space. We say that T acts linearly on V 
if there is a continuous mapping, referred to as the action.- 

r X V ^ V, 

(7,u) H> 7- u, 

such that 

1. For each 7 G T the mapping 

-.V^V, 

V ^ ^ ■ V = Pj{v) . 



is linear. 

2. (a) For any 71, 72 G T, 

7 i . (72 • v) = (71 * 72) • V. 



(b) e ■ V = V. 

Closely related to the notion of a Lie group action on a vector space is 
the notion of a representation of L with respect to a vector space V. Let 
GL(V) denote the group of invertible linear transformations of V into V. 
Then we have the following definition. 




308 



19. Normal Forms 



Definition 19.7.4 (Representation of F on V) The map 

p : V^GL{V), 

1 ^ Pj, 

is called a representation ofT on V. 

A group is an abstract object. A representation of a group gives rise to 
a more concrete manifestation of the group in terms of specific types of 
transformations on a vector space. This enables us to be able to perform 
calculations for a specific representation. This is (loosely) analogous to the 
situation with a linear map on a vector space. The map itself is an abstract 
object. However, in order to perform certain calculations we often have to 
choose a basis in order to obtain a representation for the map. The map 
has an abstract existence that is independent of any choice of basis for the 
vector space on which it acts. Similarly, a given group can have different 
representations . 



19.7a Examples of Lie Groups 

We now give some examples of concrete Lie groups. Following our discus- 
sion above, note that in some of the examples we can define the Lie group 
in a rather abstract way. Yet when we mathematically describe a Lie group 
in terms of the way in which it transforms a vector space (e.g., such as rota- 
tions, reflections, etc.) we typically resort to writing down formulae, which 
is equivalent to choosing a specific representation. Often this (seemingly) 
subtle point is of little consequence in elementary applications of these 
ideas. However, the appropriate representation for a group of symmetries 
can greatly simplify computations in specific problems. 

0{n): The n-dimensional Orthogonal Group. This is the set of n x n 

matrices. A, satisfying 

AA'^ = id. 



SO{n): The Special Orthogonal Group. This is the set of matrices 
A e 0{n) such that det A= 1. 

Zn'. The Cyclic Group of Order n. ^ This is the set of rotational sym- 
metries of a regular n-sided polygon. It consists of rotations of the 
plane through the angles 

O-TT 

0, 6,26, {n- 1)0, 6=—. 

n 



^Recall that the order of a finite group is the number of elements in the group. 
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Hence, it can be identified with the set of 2 x 2 matrices generated ^ 
by 

= 

Dn- The Dihedral Group of Order 2n. This is the set of all symme- 
tries of a regular n-sided polygon. It consists of rotations of the plane 
through the angles 

27T 

0, 6», 20, ..., (n- 1)6», 0=—, 

n 




as well reflection in the x-axis (which is referred to as a “flip”). 



It can be identified with the set of 2 x 2 matrices generated by i? 2 ^ 
along with the flip 



1 0 
0 -1 



Zn is a subgroup of D„. 

U (n): The n-dimensional Unitary Group. This is the set of n x n ma- 
trices, A, satisfying 

AA* = id. 

SU{n): The Special Unitary Group. This is the set of matrices A G 
U{n) such that detH = 1. 

Sp{2n): The symplectic Group. Let denote a nondegenerate, skew- 
symmetric, bilinear form on The set of linear transformations 
A : which preserve 12, i.e., 

Q.{Au, Av) = 12('u, v), Vu, V G 

forms a group, which is called the Symplectic group. 

T": The n-dimensional torus. The n dimensional torus 

T" = S'! X • • • X 

' V- ' 

n times 



^Recall that a collection of elements is said to generate a group if all elements 
of the group can be expressed in terms of those elements. 
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can be viewed as a Lie group by identifying 0 G T" with the matrix 



/Rg, 0 0 ••• 0 \ 

0 Rg^ 0 • ■ • 0 
0 0 i?03 ■ ■ • 0 



V 0 



0 0 



■■■ReJ 



which is an element of GL(K^"). 



19.7b Examples of Lie Group Actions on Vector 
Spaces 



Now we consider some examples of group actions. 



SO{2) acting on We identify SO{2) with the one- parameter family 
of matrices 



Rg 



f cos 0 — sin d \ 
sin 9 cos 9 J ’ 



0 < 6» < 27t, 



and let it act on points in via standard matrix multiplication. 

Equivalently, we identify points (x, y) G with complex numbers 
z = x + iy & <C. We can also identify a point Rg G 5'0(2) with a point 
9 in the circle group, 5”^. Then we can view acting on C as follows: 

9-z = e^^z. 



0(2) acting on The same as above, however we append to the ma- 
trices Rg the flip 

1 O' 

0 -1 



In the complex setting, we append to the flip, which is defined by 
the complex conjugation operation 



S'0(2) acting on We define an SO{2) action on by allowing ma- 
trices of the form 

( cos 0 — sin 0 0 \ 
sin 9 cos 6* 0 I , 0 < 0 < 27 t, 

0 0 l) 



to act on elements of via the usual matrix multiplication. 
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Equivalently, we can identify with C x M and define a action 
on C X K by 

9 ■ {z, x) = x). 

SO{2) acting on We express this group action in complex coordi- 
nates by identifying with C^. For a group element </> G SO(2), we 
define an action on C^: 

(j)- {zi,Z2) = (e*'^2i,e*‘^Z2)) • 



0(2) acting on M"*. We express this group action in complex coordinates 
by identifying with C^. Recall that 0(2) is the same as SO(2), 
together with a flip. We define the following action of 0(2) on C^: 

0 • (zi,Z2) = (e'^zi, e*'^Z2)) , (<l> G 50(2)), 

K ■ (zi,Z2) = (zi,Z2), (k = flip in 0(2)). 



Dn acting on We identify with C in the usual way. Then an action 
of Dn on C is given by 



9-z = e^^z, (9 



— ), and K ■ z = z. 
n 



Zn acting on We identify with C in the usual way. Then an action 
of Dn on C is given by 



2 = 




action on C^. We define a two-torus action on as follows: 
(9,<P) ■ (zi,Z2) = (D^Zue^^Z2), (9,<j>) G T^. 



Dn X acting on We express this group action in complex coordi- 
nates by identifying with C^. 



1 ■ (zi,Z2) = {e''^Zi,e''^Z2) , (7 G Z„) 

K - (21,2:2) = (21,22), 

6 »- (21,22) = (e*^2i, 6*^22) , ( 9 gS ^), 

where 

QiTt 

j = 0 , (f), 2((>, . . . , (n - l)(j), (j )= — , 

n 
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0 ( 2 ) X acting on We express this group action in complex coor- 
dinates by identifying with C^. Using the action of 0 ( 2 ) on as 
defined above, we define the following action of 0(2) x 5”^ on C^: 

^ • (zi,Z2) = e*^Z 2 )) , (9 G S^), 

0 • (zi,Z2) = (e'^zi, e*-^Z 2 )) , G SO( 2 )), 

K ■ (zi,Z2) = (zi,Z2), (k = flip in 0(2)). 

Z 2 © Z 2 acting on The group Z 2 © Z 2 has four elements (e, S), where 
e = ±1, 6 = ±1. We define a group action on K^. For any (x, y) G 
the group element (e, S) acts on it as follows: 

(e,i5) • (x,y) = (ex,6y). 

19.7c Symmetric Dynamical Systems 

Now we can give the definitions that specify precisely what we mean by a 
symmetric dynamical system. 

Definition 19.7.5 (F-Equivariant Map) Now let f : V ^ V denote a 
mapping of a vector space V into V . Let F denote a compact Lie group 
with a specified action on V. We say that f is F-equi variant with respect 
to this action if 

f(jx)=jf(x), V7 GF, xGU. 

If / is a vector field then we will refer to it as a F- equi variant vector field. 
It follows that if x(t) is a solution of a F-equivariant vector field x = f(x), 
then jx(t) is also a solution, for each 7 G F. 

Definition 19.7.6 (F-Invariant Function) Now let f : U — K denote 
a mapping of a vector space V into M. Let F denote a compact Lie group 
with a specified action on V . We say that f is F-invariant with respect to 
this action if 

f(ix) = f(x), V7 G F, a; G U. 

19.8 Exercises 



1 . Prove that the transformations 

(xi,yi,X2,y2) {-xi, -yi, X2,y2) ->■ (xi, yi, -X2, -y2) 

-> {xi,yi, X2,y2) ->■ (x2,y2,xi,yi), 



define a D4 action on 
2 . Prove that the transformations 

{xi,yi,x2.y2) i-xi,-yi,x2,y2) {xi.yi, -X2. -y2), 



define a Z2 ® Z2 action on . 
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3 . Prove that the transformations 

(^1 , 22 ) (cos 0zi + sin 0Z2, — sin 9zi + cos OZ 2 ), 0 G [0, 27t). 

define an SO( 2 ) action on C^. 

4 . Prove that the following transformations 

21-^2, 

2 I— >• 22, 

define a D4 action on C. 

5 . The following exercise comes from Sethna and Feng [ 1991 ]. Consider the following 
vector field on C X C: 

Zj — iuJZj + aj \zi 1^21 + bjz\z2 

+Cj|2i|^Z2 + djZlkzP + fijZlzl + /j|z2|^Z2,i = 1.2, 

where a^, bj, Cj, dj, Cj, fj are complex coefficients, where Zj = Xj iyj. Derive con- 
ditions on the coefficients under which the vector field is equivariant with respect to 
the following group actions. 

(a) D4: 

{xi,yi,x2,y2) {-Xi,-yi,x2,y2) -t (a^i, yi, -a^ 2 , -^2) 

->• {xi,yi,X2,y2) -t (a^ 2 , 2 / 2 , ail, yi). 

(b) Z2 0 Z2 '■ 

(ail, yi, ai2, y2) ^ (-aii, -yi, ai2, y2) ^ (aii, yi, -ai2, -y2)- 

(c) 0 ( 2 ): 

(- 21 , 22 ) (cos 9zi + sin 9z2 , — sin 9zi -\- cos ^ 22 ) , ^ G [0, 27t) , 

(21, 22) 1-^ (21, 22) 

6 . Consider a vector field having an equilibrium point at the origin with the (real) Jordan 
canonical form of the matrix associated with the linear part given by 

( 0 —LJi 0 0 \ 

oji 0 0 0 I 

0 0 0 —LU2 I ’ 

0 0 LJ2 0 / 

where uji, CJ2 > 0 . 

(a) Suppose muji + nuj2 ^ 0 for any nonzero integers n and m. Prove that the 
normal form, computed up to any finite order, is equivariant with respect to T^. 

(b) Suppose mcui + nuj2 — 0 for some nonzero integers n and m, with n and m 
relatively prime. Prove that the normal form, computed up to any finite order, 
is equivariant with respect to 

7 . Consider a vector field having an equilibrium point at the origin where the matrix 
associated with the linear part is given by 




Suppose the vector field is equivariant with respect to a D2 action. Compute the 
normal form through third order terms. 
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8. Consider a vector field having an equilibrium point at the origin where the matrix 
associated with the linear part is given by 



( 




a; > 0. 



Transforming to complex coordinates, consider the following action on C: 

2 I— >• 2, 

2 1 -^ iz. 



Compute a normal form for the vector field that is equivariant with respect to this 
group action. 

9. Prove that: 

0 ■ (zi, Z 2 ) = e*'’z2)) , (6 G S^), 

4> ■ (zi, Z 2 ) = (e“*'^zi, e*'^Z2)) , (4> G SO(2)), 

K ■ (zi , Z 2 ) = (Z2, Zi), (k = flip in 0(2)), 



defines an 0(2) X action on C^. (This result is due to van Gils [1984], see also 
Golubitsky et al. [1988]. Hint: Use the coordinates 22 ( 1 , i) + 2 i(l,— z).) 

10. Consider an autonomous vector field on having an equilibrium point at the origin 
where the matrix associated with the linearization is given by 

( 0 —LOi 0 0 

o;i 0 0 0 

0 0 0 —u)2 

0 0 u;2 0 




Compute a normal form, up to some finite order, that is equivariant with respect to 
0(2) X . (Note: two 0(2) X actions were described above. Consider each.) 



19.9 Normal Form Coefficients 

Up to now we have only been concerned with calculating the form of the 
nonlinear terms in the normal form. However, in applications once will 
need to know the coefficents on each nonlinear term in the normal form as 
a function of the Taylor coefficients of the original vector field. Fortunately, 
this is the type of problem that only needs to be once. Below we summarize 
some of the known results for the normal forms of autonomous vector fields 
near non-hyperbolic equilibria. 

The Non- Semisimple Double Zero Eigenvalue: 

The following results are due to Knobloch [1986a]. Consider a two- dimen- 
sional autonomous vector field having an equilibrium point at the origin 
where the Jordan canonical form of the matrix associated with the lin- 
earization is given by 

. / 01 \ 
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Taylor expanding about the origin, the vector field has the following general 
form 



tti + 6i X y + Cl 
+ 62 X y + C2 



di + Cl y + f I X y'^ + gi y^ 
d,2 x^ + €2 x'^ y + f2 X y'^ + 52 y^ 



0(4) 



We know that through a sequence of nonlinear coordinate transformations 
the vector field can be transformed into the following form 



0 

A u'^ + B u V 



0 

Cu^ + Du^ 



0(4). 



Knobloch [1986a] has computed the coefficients of the normal form in terms 
of the original Taylor coefficients of the vector field. These are summarized 
in the following table where it is assumed that the coefficients C and D are 
nonzero, unless otherwise indicated. 



Case Conditions 


A 


B 


c 


D 


lA 


A yf 0,R yf 0 


02 


2ai + 62 


d2 + hia2 — a\h2 


0 


2A 


A yf 0,R = 0 


02 


0 


d,2 b±a2 2a^ 


0 


3A 


A = 0,R yf 0 


0 


2ai + 62 


d2 — ai&2 


62 -l- 3di — 01C2 












+ b2{bi + C2)/2 


4A 


A = 0,5 = 0 


0 


0 


C?2 “1“ 2(3,2 


62 J- 3di 












— 2a\C2 — 0161 



For the next case the expressions for the normal form coefficients are so 
long that we do not reproduce them here. Rather, we refer the reader to 
the appropriate papers in the published literature. 

A Zero and a Pair of Pure Imaginary Eigenvalues: 

Consider a three- dimensional autonomous vector field having an equilib- 
rium point at the origin where the real Jordan canonical form of the matrix 
associated with the linearization is given by 

/O -w 0\ 

J=jajOoj, u;>0. 

\0 0 0 / 



Coefficients for the normal form can be found in Wittenberg and Holmes 
[1997]. 
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A Pair of Pure Imaginary Pairs of Eigenvalues: 



Consider a four- dimensional autonomous vector field having an equilibrium 
point at the origin where the real Jordan canonical form of the matrix 
associated with the linearization is given by 



/ 0 -wi 0 0 \ 

0 0 0 

0 0 0 -W2 

\ 0 0 W2 0 / 



> 0, W2 > 0. 



For the nonresonant case defined by muji + nuj 2 yf 0, 0 < \m\ -I- |n| < 4 
normal form coefficients have been calculated by Knobloch [1986b]. For the 
case of 1 : 1 resonance in the non-semisimple case normal form coefficients 
have been calculated by Namachchivaya et al. [1994]. 



19.10 Hamiltonian Normal Forms 

In this section we want to describe the procedure for transforming a Hamil- 
tonian vector field into normal form in the neighborhood of a fixed point. 
In particular, most of our attention will be focussed on elliptic fixed points, 
i.e., fixed points having the property that the eigenvalues of the matrix 
associated with the linearization are purely imaginary, with nonzero imag- 
inary parts. This is a very old subject where the original results are often 
attributed to Birkhoff [1966] (for the nonresonant situation) and Gustavson 
[1966] (for the resonant situation). Many books and papers contain expo- 
sitions of the theory of Hamiltonian normal forms in one form or another, 
and with varying degrees of completeness. See, e.g., Abraham and Marsden 
[1978], Guillemin and Sternberg [1984], Meyer and Hall [1992], Arnold et al. 
[1988], Sanders and Verhulst [1985], and Saenz et al. [1986]. Our exposition 
follows that given in Ghurchill et al. [1983]. 



19.10a General Theory 

Our calculations will be greatly simplified if we use the form of Hamilton’s 
equations in complex variables given in Ghapter 14. We begin by developing 
some notation. Let Vr denote the set of real valued homogeneous polyno- 
mials of degree r > 2 in the complex variables Zj = xj + iyj, Zj = Xj — iyj, 
j = 1, • • • , n. We want to consider a formal power series of the following 
form 

H = H2 + Hs + --- + H,^ + ---, Hr€Vr- (19.10.1) 

We denote the space of formal power series by P and, as a result of the 
form of (19.10.1), we use the notation 

V = (Bf°=2'Pr, 
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to denote the space of formal power series under consideration. This rather 
abstract notation will allow us to very succinctly discuss certain algebraic 
structures that would be very cumbersome to express in coordinates. 

However, keeping with the subject matter of this section, you can think 
of H as a real Hamiltonian expressed in complex variables and having 
the value zero at the origin, with the corresponding Hamiltonian vector 
field having a fixed point at the origin. Issues such as convergence and 
differentiability will not play a role in the formal algebraic manipulations, 
so we will address these later on. 

For a fixed F G V we define the linear map 



! F — )■ 7^, 

H ^ ndpiB) = [F, H], (19.10.2) 



for any H G V, where 



[H,G] 




dH dG 
dzj dzj 



dH dG\ 
dzj dzj) ’ 



(19.10.3) 



d_ _l f _ ,jr\ .d_\ 

dzj 2 \ dxj * dyj ) ’ dzj 2 \ dxj * dyj ) 

Note that 

ndF{H) = -ndniF). (19.10.4) 



[•, •] is an example of a Lie bracket] note that it is the negative of the usual 
Poisson bracket of two functions, i.e., [•,•] = —{•,•}. It is easy to verify 
that for F G V 2 we have 



adplp^ : Fr — >■ Fr- 



We also define 

ad^ = identity mapping onP, 
ad^ ^ ad p 
and, inductively. 



(19.10.5) 



addp = adp ° s^d^p j > 1. 

For F G Fs, Hr G Fr, using (19.10.2) and (19.10.3) we can show 

adp{Hr) G Fr+s-2- (19.10.6) 

Using this relation, we can inductively verify that 

ad^p{Hr) G Fr+j(s-2)- (19.10.7) 
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The exponential map will also play an important role for Hamiltonian nor- 
mal forms. For K G Vs define 

exp(adK) : V ^V 

OO . 

H !->■ ^ —adj^(H) = exp (ad/f)(H)) . (19.10.8) 

i=o 

We now define what we mean by the term normal form of a Hamiltonian. 

Definition 19.10.1 (Normal Form) An element H = & V is 

said to be in normal form through terms of order to > 2 with respect to 
F gV if adpiHr) = 0 for 2 < r < to. 

The following definition is important for the procedure of computing the 
normal form. 

Definition 19.10.2 (Splitting) F GVr is said to split V if for r >2 we 

have 

Vr — 0 Fr 

where Nj. = Ker{adF\vr) Rr = Im{adF\'Pr)- When F splits V then 
<idF\Rr. is an isomorphism and we let Tr : Rr ^ Rr denote the inverse. 

The following is the main normal form theorem and provides an algo- 
rithm for transforming a Hamiltonian into normal form order- by-order. 

Theorem 19.10.3 Let H = GV bein normal form through terms 

of order (to — 1) > 2 with respect to H 2 , and assume that H 2 splits V. 
Let Hm = Hm + Hm, where Hm G Nm and Flm G Rm, and set Km = 
Fm (^Hm^ ■ Then exp (adKm) (H) is in normal form through terms of order 
TO with respect to H 2 , it agrees with H through terms of order m — 1, and 
it has Hm as the term. 

Proof: From (19.10.8) we have 

OO .. 

exp (ad/f„) {H) = ^ -^JH) 

j=0 

= H + adK^{H) + {terms 'mVj, j > 2m — 2} 



— H 2 + i?3 Hm-1 + Hm + adif^ {H 2 ) 

+ {terms inP^, j > to -I- 1} , (19.10.9) 



which one easily sees agrees with H through terms of order to — 1. 
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Next we want to show that (19.10.9) is in normal form through order m 
with respect to i? 2 - For this to be true, by definition 19.10.1 we must have 

ad//2 {Hm + ddKrn{H2)) = 0. 

This can be verified through the following simple calculations: 

adi/2 i.Hm + adK^(i/2)) = + ad//^ {ddK^(,H2)) 

ad//2(F^yn) ad//2 (ad//2(.^m)) 

— ad//2 ad//2(.ffm)) 

ad i /2 ad/i 2 (ad^^ 

— adi^2 (FFyn 

= adii2(iF„) = 0, (19.10.10) 

since FF^ G fVm. This calculation also reveals that the order m term in the 
normal form is H^. More precisely, the order m term of (19.10.9) is given 
by 

FFtti “t“ adi^^(FF2) ad/i2 (Ffj,^) 

= Hm-H^ = Hm. (19.10.11) 

The theorem is now proved. □ 

The following results show how the normalization transformations are 
related to symplectic or canonical transformations. 

Proposition 19.10.4 Let H{z,z) = converge in some 

neighborhood U of the origin in R^". Assume that H, considered as an 
element ofV, is in normal form with respect to H 2 through terms of order 
(m — 1) > 2. Denote = Hm + H^, where Hrn G and Hm G Rm, set 
Km = Lm{Hm), and let (ft denote the flow generated by the Hamiltonian 
vector field z = —2i . Then 

1. There is a neighborhood V CU of the origin such that (ft is defined 
in V for all |t| < 2; and 

2. exp{adKra){H) = H o (f\. 

We remark that coordinate transformations generated by the time-one 
flow map of the solutions of Hamilton’s equations are often referred to as 
Lie transforms. 

Proof: 

1. The origin is an equilibrium point, hence the flow exists at this point 
for all t. Since flows have open domains, there is an open set contain- 
ing the origin such that the flow is defined for |t| < 2. 
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2. Let F{t) = H o (fit- Taylor expanding F{t) about t = 0 gives 

F(t) = F(0) + F'(0)t+ ^F”(0)t^ + • • • + + • • • . (19.10.12) 

2! n! 

Now we want to evaluate the Taylor coefficients. Recall that 
adK^H) = [Km, H]=- {Km, H} = [H, Km} , 

hence, from the formula for the time evolution of a function under 
the flow generated by a Hamiltonian vector field (see Chapter 14), 
we have 

P'{t) = ^ ° = ad/f^(iL) o (j)t. 

Repeatedly differentiating this expression, we find 
F"(t) = ad^^(iL)o</., 

F(")(t) = ad^^(iL)o<^, 



and substituting these expressions into (19.10.12) gives 
i^(l) = H o = 11 + adic„, (H) 

+ ^ad^^(iL) H + — ad^^(iL) H 

CXD . 

= — ad^^(iL) = exp(adic^) (iL), (19. 10. 13) 

j=o 



which proves the result. □ 

Corollary 19.10.5 exp{adK^) is a symplectic transformation. 

Proof: This follows immediately from Theorem 14.3.6 in Chapter 14 since 
exp(ad/fm)(^f) = H o (pi j and (pi is the time-one map obtained from a 
Hamiltonian flow. □ 
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An Example Calculation: Normalization Through Terms of 
0(3) 

We will now illustrate the use of Theorem 19.10.3 by beginning with a 
Hamiltonian of the form 

H = H2 + H3 + + • • • + Hjn + • • • , 

and normalizing it through terms of 0(3). From Theorem 19.10.3 we have 

exp (ad/fg) {H) = H + 

+ {terms inVj,j > 5} 



— H2 + H3 + H3 + B,dK^{H2) 

+ Hi + adxsiH^) + “^^^ 3 (^ 2 ) +0(5) 

' V " 

0(4) terms 



(19.10.14) 



Now 



ndn^iKs) = ad/42 (ad^^(Ff3)) = H 3 = -adK3(H2), 
from which it also follows that 

ad^3(i?2) = -ad^3 



(+)• 



Using these to relations, (19.10.14) can be written as 

exp (ndKaiH)) = i ?2 + ^3 + ^^4 + adiC3(i7s) - ^a,dK 3 (H 3 ) +0(5). 



0(4) terms 

This equation can be further simplified. Note that 

adiC3(i?3) - ^a.dK3{H3) = ad/f3(i?3) - a.dK3(H3) 
+ -a,dK3{H3) 



(19.10.15) 



— ad/f3(i?3 — H 3 ) + -&dK 3 {H 3 ) 



= ad/c3(i?3) + -adiC3(i?3) 



(19.10.16) 



Substituting this expression into (19.10.15) gives 



exp (adiC 3 (H)) — H2 + H3 + Hi + adiCa {H3) + -ad;c 3 {H3) +0(5). 

^ 'V ^ 

0(4) terms 



(19.10.17) 
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By Theorem 19.10.3, (19.10.17) is in normal form with respect to H 2 
through terms of order 3. 

We make several remarks concerning this computation. 

1. Normalizing the 0{3) terms modified the 0(4) terms, but did not 
modify the 0{2) terms. 

2. Note that in deriving explicit expressions for the normal form we will 
need to compute Km- We will address this issue in a more explicit 
context in the next subsection. 

3. How might one use a partially normalized Hamiltonian such as 
(19.10.17)? One situation is if the normalized part possesses some 
special solutions or structure, e.g. it possesses periodic orbits it is 
integrable. In that case, by appropriate scaling, one may be able to 
consider the unnormalized part of the Hamiltonian, or “tail” as a 
perturbation to the normalized part. The advantage of this lies in 
the fact that there are a great deal of techniques for studying per- 
turbations of periodic orbits and completely integrable Hamiltonian 
systems. 

19.10b Normal Forms Near Elliptic Fixed Points: 
The Semisimple Case 

Now we specialize to the case that will be our primary interest. Henceforth 
we will assume that 



H2{z,z) = Y,y\^j\^ ^'P2- (19.10.18) 

i=i 

This is the Hamiltonian corresponding to a linear Hamiltonian vector field 
having an elliptic fixed point at the origin, i.e., the eigenvalues of the matrix 
associated with the linear vector field are ±zwi, • • • , yf 0. This 

form assumes that the linearization is semisimple, i.e, it’s complexification 
is diagonalizable, which is true if the LUi, i = 1, • • • , n, are distinct. 

Definition 19.10.6 An elliptic fixed point is said to be resonant if there 
exists a nonzero integer n-vector, i.e., k G Z" — {0}, such that 

n 

(fc, w) = ^ = 0. (19.10.19) 

The order of the resonance is defined to be 

n 

\k\ = 1 ^* 1 - 

2 = 1 
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For fixed ui, the number of independent integer vectors that solve (19.10.19) 
is referred to as the multiplicity of the resonance. If (19.10.19) has no 
integer solutions, then the elliptic fixed point is said to be nonresonant. 

The following abbreviated notation will be used throughout the remain- 
der of this section 



z'^z^ = ■■■ ■■■Zn, ki, k>0,i = !,■■■, n, (19.10.20) 

and 

n 

{k — l,uj) = J2ikj-l,)u>j. (19.10.21) 

1=1 

Next we give a proposition that shows why the method of normalization 
is often referred to as averaging, since we will show that the terms that 
cannot be removed by the normal form transformation are averages over 
trajectories of the Hamiltonian vector field corresponding to H 2 {z,z) = 

Proposition 19.10.7 Let pt denote the flow generated by the Hamiltonian 
vector field z = — Then 



Hm{z,z) 



lim — 
T-s-oo T 




o Pt) {z,z)dt. 



Proof: By definition 19.10.1 we have ad_f/ 2 (Hm) = 0, hence 



dt 



(^Hm ^ Pt^ 



— o Pt — 0, 



(19.10.22) 



which implies that Hm is constant on trajectories generated by the Hamil- 
tonian vector field corresponding to H 2 . Recall also that 

^ {Km o Pt) = ad//2 (^rm(ilm)^ o pt = Hm o Pt. (19.10.23) 

Now we consider the following calculation 



lim — 
T->oo T 



{HmO Pt) {z,z)dt 



= lim — 
T->oo T 




{z,z)dt+ Yiwl 

T—tao 1 



I (//,„■.«)( 



z, z)dt 
(19.10.24) 



From (19.10.22), the first term in (19.10.24) is given by 

rp f (HmOpt]{z,z)dt = Hm{z,z). 
T-s-oo T Jq \ / 



(19.10.25) 
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It remains to show that the second expression in (19.10.24) is zero, i.e., 



lim [ (HmOpt){z,z)dt = 0. 

T— >oo 1 Jq \ / 



(19.10.26) 



From (19.10.23), we have (using 4>q = 1) 



J (^Hm° Pt^ {z,z)dt= ^{KmO Pt - Kjn) ■ (19.10.27) 



Equation (19.10.26) will follow if we show that show that Km ° pT is 
bounded for all T. But this is true since the components of pt{z,z) are 
given by Zj(t) = j = 1, . . . , n. □ 



Real Hamiltonians as Functions of Complex Variables; Some 
Properties 



For explicit calculations we will need more concrete notation. We will ex- 
press our Hamiltonians in the form of a power series 

OO 

H{z,z)= Ckiz'"z\ 

\k\ + \l\=2 



with 

|fc| -|- |Z| = fci -|- ^1 ki, li > 0. 

For real Hamiltonians several relations between the coefficients of the power 
series hold. Moreover, certain algebraic simplifications of terms are possible. 
We now want to catalog these for future reference. 



Ckl = Clk 



This can be seen as follows. For real Hamiltonians we have 



H{z, z) = H{z, z). 



The Hamiltonians that we are now considering have the following 
representation 

OO 

H{z,z)= Y (19.10.28) 

| fe | + | I|=2 

and, hence, their complex conjugates are given by 

OO OO 

H{z,z)= Y Ckiz^z^ = Y^ Ckiz^zK (19.10.29) 

|fc| + |I|=2 |fc| + KI=2 

Reality implies that (19.10.28) and (19.10.29) are equal. Equating the 
coefficients of each term in (19.10.28) and (19.10.29) gives 



Ckl = Clk- 



(19.10.30) 
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If A: = / then is real This is an immediate consequence of (19.10.30). 



Algebraic Simplification, I The following relation will be useful later 
on in simplifying normal forms that we compute. Suppose that the 
Hamiltonian has two terms of the same order having the following 



form 



h -1 

CklZ z 



Cikz’-z’^, 



where 

Cki = a + ib. 

Then this term can be simplified as follows 

CklZ^Z^ + Cikz’-z^, 



= Ckiz^z^ + Ckiz^z^, using (19.10.30), 



= Ckiz’^z^ + Ckiz’^z'-, 



= (a + ib)(Rez'^z^ + Htuz^z’-) 

+ (a — ib){Yiez^z^ — ihaz^z^), 

= 2aRez^z^ — 2Hmz*zb 



(19.10.31) 



In the case where kj ^ Ij, for some 1 < 
j < n with at least one of these two exponents nonzero, this last term 
can be simplified even further: 

2aRez^z^ — 2Hmz*z* 



Algebraic Simplification, II 



= Re [2oRe z’^z'- + z2alm z^z^ + i2bRe z^z^ — 2Hm z^z^] 

= Re [2{a + ib)(Rez^z’' + ilmz^z’-)] 

= 2Re [{a + ib)z'^z‘] . 

(19.10.32) 

Now denote 

a + ib=c, c= |c|e* 



Suppose that in the term 



^k-n z:l: 






there exists exponents such that kj ^ Ij, for some 1 < j < n, with at 
least one of the two nonzero. 





326 



19. Normal Forms 



Then we transform the Zj and Zj terms as follows (and leave the 
remaining 2n — 2 terms unchanged) 

i arg c 

Zj ^ e Zj, 

i arg c 

Zj !->■ e Zj. 

Under this transformation we obtain 

2Re [(a + ib)z'^z'‘] , 

= 2Re , 

= 2|c|Re0'=z'. (19.10.33) 

The Decomposition of Hm 

In relation to our more general notation, it should be clear that 

'y ^ CklZ Z = Urn T^m- 
\ k \ + \ l\=m 



Moreover, Prn is spanned by the homogeneous polynomials z^z’’, \k\ + |l| = 
m. Given Hm, we will want to determine the decomposition Hm = Hm + 
Hm- The following calculation will be be crucial for this. 

adff, {zH^) = [H2 ,zH^] 



— ZkZk, z/ ■■■Z^^Z{ ■■■z^ 



^kn -zW zln 



,fc=l 



= -^•11 I - 



i=i 



dzi 



\ k=l 



SP \ d / ki k„-h 



\k^l 



n 

■2* ' ■ ■ • ) 
- (f u ) 



^kn -^1 



Z ^3 ^ zh 



n 

= -i ^ {iOjkj - uijlj) • • • z^«z[^ ■■■zl^ 
i=i 

= —i{k — l,uj)z^zK 



(19.10.34) 
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Thus, from (19.10.34) we see that z^z^, |fc| + |/| = m, are eigenvectors of 
ad// 2 |Pm) corresponding to the eigenvalues —i{k — l,u)). 

So, for 

Hra{z,z)= Y. (19.10.35) 

|fc| + |i|=m 

we have Hm = Hm + Hm, where 

H^z,z)= Y Ckiz'^^\ (19.10.36) 

|fe| + ti|=m 

{k—l^uj)—0 

and 

H^{z,z)= Y (19.10.37) 

|fe| + |Z|=m 

{k—l^co)^0 

From (19.10.34) we also can conclude the following 

Tm[Hm{z,z)'^ =i Y {k-l,uj)-^Ckiz'^zK (19.10.38) 

|fc| + |Z|=m 
{k—l,cij)^0 

Examples: Normal Forms for Resonant Elliptic Fixed Points 
OF Two Degree-of-Freedom Systems 

We will now compute the leading order terms for several normal forms as- 
sociated with resonant elliptic fixed points of two degree-of-freedom Hamil- 
tonian systems. We will assume that the quadratic part of the Hamiltonian 
has the following form 

H{zi,Zi,Z2,Z2) = ykip + yI^2|^- 

For two degree-of-freedom systems the resonance condition (19.10.19) can 
be stated much more simply: the elliptic fixed point is resonant if ^ is a 
rational number and nonresonant if — is an irrrational number. For the 
former case we use the phrase “wi : W 2 resonance”. 

In the following examples we will compute the terms that cannot be 
removed by the normal form transformation, i.e., the terms in Nm = 
ker (ad //2 |-p^). These terms have the form of (19.10.36), which can be de- 
duced from (19.10.34) (cf. (19.10.20) and (19.10.21)). Hence, to find the 
terms at order m in N^, we proceed as follows. Find a solution to 

(fci — Ii)loi (^2 — ^ 2)^2 = 0, ki, li > 0, (19.10.39) 



which satisfies 



fci + ^2 + ^1 + ^2 = m. 



(19.10.40) 
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and is subject to the constraint 



0 < |/ci — h\, \k2 — h\ < m. 



(19.10.41) 



The existence of such a solution implies that there is an order m term of 
the form 

k2 -h -h 
^1 ^2 ^1 *2 ■ 

All such solutions will give all order m terms. These are the terms that 
cannot be removed by the normalization procedure. 

1:1 Resonance 

To normalize the Hamiltonian at order m, the solutions of the following 
equations play the key role 



k\ — l\ k2 — W — 0 , 
k\ + k2 + h + l2= m, 

0 < |fci - /i|, 1^2 - ^ 2 ! < m, 



(19.10.42) 

(19.10.43) 

(19.10.44) 



Equation (19.10.42) is the 1 : 1 resonance condition and (19.10.43) and 
(19.10.44) are constraints placed on the exponents by the particular order 
of the terms being normalized. There is one immediate conclusion that we 
can make. Combining (19.10.42) and (19.10.43) gives 

ki + k2 = y. 

Hence, the normal form contains no odd order terms. 

We then turn to determining the order 4 terms. In this case (19.10.39), 
(19.10.40), and (19.10.41) become 



k\ — l\ -\- k2 — I 2 = 0, 
k\ -\- k2 l\ -\- 12 — 4, 
0 < |A:i — h\, 1^2 — ^ 2 ! < 4. 



(19.10.45) 

(19.10.46) 

(19.10.47) 



The solutions of (19.10.45) compatible with (19.10.47) are given by 



a) ki — li = 0, k 2 — I 2 = 0, 

b) k\ — l\ = 1 , k2 — I 2 = ~ 1 ) 

b') ki — li = — 1 , k2 — I 2 = 1 ) 

c) ki — li = 2, ^2 ~ ^2 = ~2, 

c) k\ — l\ = —2, k 2 — I 2 = 2, 

d) k\ — l\ = 3 , k2 — I 2 = ~ 3 , 

d') ki — li = — 3 , k2 — I 2 = 3 , 

e) k\ — l\ = 4, k 2 — I 2 = ~4, 

e') ki — h = —4, k 2 — I 2 = 4. 
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Next we take each of these pairs of solutions and combine them with 
(19.10.46). 

a) and (19.10.46) : This yields the solutions 

{ki,k2,h,l2) = a,l,l,l), ( 0 , 2 , 0 , 2 ), ( 2 , 0 , 2 , 0 ), 

which correspond to the terms 



respectively. 

b) and (19.10.46) : This yields the solutions 

(fci,fc2,/i,^2) = (2,0,1, 1), (1,1,0,2), 

which correspond to the terms 

2:^ZiZ2, ^1^2 ^2, 

respectively. 

b’) and (19.10.46) : This yields the solutions 

(fci,fc2,/i,^2) = (l,l,2,0), (0,2,1,1), 
which correspond to the terms 

ZiZ 2 z 1 , Z2Z1Z2, 



respectively. 

Notice that the terms corresponding to the solutions of b) and 
(19.10.46) are the complex conjugates of the terms corresponding to 
the solutions of b’) and (19.10.46). This is generally true of the primed 
and unprimed equations corresponding to the same letters; henceforth 
we will only simultaneously solve the equation corresponding to the 
unprimed letter and (19.10.46). 



c) and (19.10.46) : This yields the solution 

(fci, fc2, /i, I 2 ) = (2, 0,0, 2), 

which corresponds to the term 

2 -2 
^ 1 ^ 2 - 



d) and (19.10.46) : This yields no solutions since we must have k, ki > 0. 
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e) and (19.10.46) : This yields no solutions since we must have k, ki > 0. 

Hence, the normal form through order 4 terms has the form (where we 
have remembered to put in the complex conjugate terms) 

H{zi,Zi,Z 2,Z2) = ^\zi\'^ + i|2:2p 

+ Ciiii|zip|2;2p + C0202 |z2 |^ + C202okl|^ 

+ 2aC201lzfziZ2 + Cii 20 zlziZ 2 

+ 01102^122^2 + 0021121^222 

+ 020022iz| + C02202iz| + 0(6), (19.10.48) 

Using “algebraic simplification, II” given in Section I9.I0b, the normal 
form can be written as 

H{zi,Zi,Z2,Z2) = 2I21P + 

+ 0iiii|zip|22p + Oo 202|22|^ + O202o|2l|^ 

+ 2aReziZiZ2 — 2bhnziziZ2 
+ 2cReziZ2Z2 — 2dhnziZ2Z2 

+ 2eRezf4 ~ + 0(6), (19.10.49) 

where 



02011 =a + ib, 

01102 = c + id, 

C2002 = e + if. (19.10.50) 

Using “algebraic simplication, II” given in 19.10b, the normal form can be 
written as 

H{zi,Zi,Z2,Z2) = \\Zl\^ + 5I22P 

+0llll|2:ip|z2p + Oo022|22|^ + C220o\zi\^ 

+2|c20ii|Re2:(ziZ2 + 2\cii02\ReziZ2Z^ + 2|c2oo2|Rez(z| 

+ 0 ( 6 ). 



I 1:2 Resonance 

Following the same procedure as in the 1 : 1 resonance case, we seek 
nonnegative integer solutions of the following equations 

A:i - /i + 2(^2 - ^2) = 0, (19.10.51) 

Ai + ^2 + ^1 + ^2 = 3, (19.10.52) 

0< |/ci-Zi|, |fc2-;2| <3. (19.10.53) 
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Solutions of (19.10.51) compatible with (19.10.53) are given by 

a) ki — li = 0 , ^2 — /2 = 0 ; 

b) k\ — l\ = 2, k2 — I 2 = 

b') ki — li = — 2 , k2 — I2 = 1 ) 

We next seek simultaneous solutions of each of these pairs equations with 
(19.10.52). 

a) and (19.10.52) : This yields no solutions since we must have h, ki > 0. 

b) and (19.10.52) : This yields the solution 

(fci, ^2, /i, ^ 2 ) = (2, 0, 0, 1), 



which corresponds to the term 

Solving b’) and (19.10.52) simultaneously gives rise to the complex 
conjugate of the term arising from the simultaneous solution of b) 
and (19.10.52). 



Hence, the normal form through order 3 terms is given by (where we 
have remembered to put in the complex conjugate terms) 

H{zi,Zi,Z2,Z2)= 2 l^ll^ + + C2001^1^2 + Coi20^1^2 + C^(4), (19.10.54) 

Using “algebraic simplification, I” given in section 19.10b, the normal form 
can be written as 

H{zi,zi,Z 2 ,Z 2 ) = ]^\zx\^ + \Z 2 \^ + 2aRezlz2-2blu\zlz2 + 0{A), (19.10.55) 



where 



C 2001 = a + ib. 



Using “algebraic simplification, II” given in section 1 9. 1 Ob, the normal form 
can be written as 

H{zi,zi,Z 2 ,Z 2 ) = + |z 2 p + 2|c200i|Re2iZ2 + 0(4), (19.10.56) 



I 1:3 Resonance 

Following the same procedure as in the previous two cases, we find that 
there are no third order terms. The fourth order terms can be obtained 
through solution of the following equations 

ki — li 3(^2 — I 2 ) = 0, 
ki k2 12 = 4, 

0 < |A:i — h\, \k2 — / 2 I < 4. 



(19.10.57) 

(19.10.58) 

(19.10.59) 
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Solutions of (19.10.57) compatible with (19.10.59) are given by 

a) ki — l\ = 0, k 2 — I 2 = 0, 

b) k\ — l\ = 3, k2 — I 2 = 

b'^ k\ — l\ = —3, k2 — I 2 = 1- 



We next solve each of these pairs of equations simultaneously with 
(19.10.58). 

a) and (19.10.52) : This yields the solutions 

(A:i,fc2,^i,^2) = (0,2,0,2), (2, 0,2,0), (1,1,1,1), 

which correspond to the terms 



respectively. 

b) and (19.10.52) : This yields the solutions 

{ki,k2,li,l2) = (3, 0, 0, 1), 

which correspond to the terms 

zfz2. 



Solution of b’) and (19.10.58) yields the complex conjugate of this 
term. 

Hence, the normal form through order 4 terms is given by (where we 
have remembered to put in the complex conjugate terms) 

H{zi,Zi,Z2,Z2) = + ^\Z2\'^ 

+ C0202|^2|^ + C202okl|"* + Cun |zi ^ |Z2 ^ 

+ C3oo 1^1^2 + Coi30^1^2 + C^(5), (19.10.60) 

Using “algebraic simplification, I” given in section 19.10b, the normal form 
can be written as 

H{zi,Zi,Z 2 ,Z 2 ) = + \\Z 2 ? 

+ C0202|^2|^ + C202okl|"* + Cnn |zi ^ |z2 P 
+ 2aRezlz2 - 2blm.zlz2 + 0(5), (19.10.61) 
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where 



C3001 = a + ib. 



Using “algebraic simplification, II” given in section I9.I0b, the normal form 
can be written as 

H{zi,Zi,Z 2,Z2) = ^\zi\'^ + 

+ Co 202|^2|"^ + C202okl|"* + Cim | Zi ^ | Z2 P 
+ 2\csooi\^ezfz2 + 0{5). (19.10.62) 



There is a large literature on the analysis of the dynamics near resonant 
elliptic fixed points in two degree-of-freedom Hamiltonian systems; see, e.g., 
Arnold et al. [1988], Sanders and Verhulst [1985], Henrard [1970], Meyer 
and Hall [1992], and Golubitsky et al. [1995]. 



19.10c The Birkhoff and Gustavson Normal Forms 

We now describe the Birkhoff normal form and the Gustavson normal form. 
These are normal forms in the neighborhood of an elliptic fixed point in 
the nonresonant and resonant cases, respectively. We begin with some def- 
initions. 

First we recall the form of the Hamiltonian in the neighborhood of an 
elliptic fixed point that we described earlier: 

1 " 

H{z, ^) = 2 ^3 (-2:,^) + H^{z,z) H . (19.10.63) 

^ i^l 

Near an elliptic equilibrium point symplectic polar coordinates provide a 
useful coordinate system. These are related to the complex coordinates by 

Zfc = , Zfe = k = l,...,n. 

However, one must be aware that singularities may arise as — >■ 0. Next 

we define the notion of Birkhoff normal form near an elliptic equilibrium 
point. 

Definition 19.10.8 (Birkhoff Normal Form) The Hamiltonian 
(19.10.63) is said to be in Birkhoff normal form of degree r if it is a poly- 
nomial of degree r in Zi, Zi, which is actually a polynomial of degree [|] in 
the variables pi = \ \zi\^ , where [|] denotes the integer part of the number 

r 

2 ■ 

Example 19.10.1. Consider the two degree-of-freedom version of (19.10.63) 
with terms through order 4. Then the Birkhoff normal form of degree 4 has the 
form 

H = LOipi LO 2 P 2 -f apl -1- bpip 2 cp%. 



End of Example 19.10.1 
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Now we can state Birkhoff’s theorem (Birkhoff [1966]). 

Theorem 19.10.9 (Birkhoff) Suppose the uji in (19.10.63) satisfy no 
resonance relations of order less than or equal to r. Then there exists a 
symplectic change of coordinates defined in a neighborhood of the origin 
such that in the new coordinates the Hamiltonian is reduced to Birkhoff 
normal form of degree r up to terms of order r + 1, i.e., 

H^nr{p) + o(\z\, \z\r+\ 



Proof: See Exercise 3. □ 

Next we consider the case where there are some resonances of order less 
than or equal to r. In this case we first define what we mean by a resonant 
normal form near an elliptic equilibrium point. 

Definition 19.10.10 (Resonant Normal Form) Let K he a subgroup 
of the lattice of integers 2” . A resonant normal form of degree r under res- 
onances from K (or, a K resonant normal form of degree r) is a polynomial 
of degree r in Zi, Zi which, when expressed in terms of polar coordinates, 
depends only on the phases through the combinations k ■ (p, with k € K. 

Now we can state Gustavson’s theorem on resonant normal forms (Gus- 
tavson [1966]). 

Theorem 19.10.11 (Gnstavson) Suppose the coi in (19.10.63) satisfy 
no resonance relations of order less than or equal to r, except, possibly, 
for relations k ■ to = 0, k € K. Then there exists a symplectic change of 
coordinates defined in a neighborhood of the origin such that in the new 
coordinates the Hamiltonian is reduced to a K resonant normal form of 
degree r up to terms of order r -|- 1 . 

Proof: See Exercise 4. □ 



19.10b The Lyapunov Subcenter Theorem and 
Moser’s Theorem 

We now describe two classical results that describe the behavior near equi- 
librium points of Hamiltonian systems. 

Gonsider an n degree-of-freedom Hamiltonian having the following form 

H{x,y) = H 2 {x,y) + H^{x,y) + Hi{x,y) H , (19.10.64) 

where 

X = (xi, . . . ,x„) G M”, y = (j/i, ...,yn) G K”, 

H2{x,y) = + 2/i) H 1- + lJn)^ 



and 



(19.10.65) 
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and Hr denotes a homogeneous polynomial of degree r in a; and y. We must 
further assume that H is analytic in a neighborhood of the origin, which 
is an elliptic equilibrium point for the corresponding Hamiltonian vector 
field. 

Theorem 19.10.12 (Lyapunov Subcenter Theorem) If for all s > 1 

the ratio ^ is not an integer, then there exists an invertible, analytic, 
canonical transformation {x,y) i— 1 (C,??) which transforms the Hamiltonian 
H{x,y) to the form 

H{f,y) = Hp)+0{\C\^), (19.10.66) 

where ^ is a function of the single variable p = C = 

(^2 ) • ■ • ) j ^2 ) ■ • ■ ) Pn ) ■ 

Proof: See Siegel and Moser [1971] and Moser [1958]. □ 

From the form of the transformed Hamiltonian one sees immediately that 
^ = 0 is a two dimensional invariant manifold, with the dynamics on this 
manifold given by 



6 = 2^(p)t7i, 
dp 

_ 

Pi = -2^(p)Ci- 

dp 

If one multiplies the first equation by the second by pi, adds them 
together and uses the definition of p one sees immediately that p is a 
constant of the motion on this two dimensional invariant manifold. Since 
p = + ? 7 (, we see that this two dimensional invariant manifold is filled 

with periodic orbits. Moreover, the form of the equations, restricted to 
the periodic orbit p=constant, is that of a linear harmonic oscillator with 
frequency ^(p). 

The Lyapunov subcenter theorem states that under the nonresonance 
assumptions given by the theorem, the corresponding two dimensional in- 
variant subspace of the linearized Hamiltonian vector field filled with peri- 
odic orbits, all having frequency uii, persists for the full nonlinear problem, 
however the frequencies may change slightly. (Keep in mind that this is a 
local result valid in a neighborhood of the equilibrium point.) If the nonres- 
onance condition is violated then the result does not hold in the sense that 
a manifold of periodic solutions does not exist for the nonlinear Hamilto- 
nian vector field (Roels [1971a], [1971b]). Periodic orbits may persist from 
the linearized problem. However, analytic manifolds of such solutions do 
not exist. See Weinstein [1973], Moser [1976], [1978], and Ito [1989]. 

There is one other point to be made. As mentioned in the remarks at 
the end of our discussion of normal forms, generically normal form trans- 
formations are divergent when taken to all orders. However, it may be that 
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the transformation is convergent on a lower dimensional submanifold, as it 
is in the situation described by the Lyapunov subcenter theorem. 

Moser [1958] has generalized the Lyapunov subcenter theorem to the case 
where there is exactly one pair of real eigenvalues, A and —A. We state his 
theorem. 



Theorem 19 . 10.13 (Moser’s Theorem) Suppose H2 in the Hamilto- 
nian (19.10.64) has the form 

H2{x,y) = Xxiyi + y(a;^ + 2/1) + •••+ y(a;^ + 2/^). (19.10.67) 

Then there exists an invertible, analytic, canonical transformation {x, y) 1— >■ 
(^,77) which transforms the Hamiltonian H{x,y) to the form 

H{(,y) = <^>{p)+0{\C\^), (19.10.68) 



where ^ is a function of the single variable p = fipi, and C, = 
(^2 ; ■ ■ • j fn Vn ) ■ 

Proof: See Moser [1958]. □ 

From the form of the transformed Hamiltonian one sees immediately that 
(C = 0 is a two dimensional invariant manifold, with the dynamics on this 
manifold given by 



d<t> 

a = y(d)a, 

dp 

m = 

dp 



If one multiplies the first equation by r/i, the second by ^ 1 , adds them 
together and uses the definition of p one sees immediately that p is a 
constant of the motion on this two dimensional invariant manifold, and 
the trajectories are given by the hyperbolae defined by p=constant. 



19.10E The KAM and Nekhoroshev Theorem’s 
Near an Elliptic Equilibrium Point 

Suppose the Wj in (19.10.63) satisfy no resonance relations of order less 
than or equal to 4. Then, in a neighborhood of the origin, the Hamiltonian 
can be transformed to Birkhoff normal form through degree 4: 

H = Ho(p) +0(5), (19.10.69) 



^o(p) = E 



^iPi 






jPiPj- 



i,j=i 



where 
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The Hamiltonian Hq(p) defines a completely integrable Hamiltonian system 
where the trajectories lie on the invariant tori p = constant. Now for p 
sufficently small one could view the 0(5) terms as a perturbation of Hq{p). 
In this case it is natural to think of trying to apply the KAM theorem to 
see if any of the invariant tori of Ho{p) are preserved when the effect of 
the 0(5) terms is considered. This can be done if i?o(p) is appropriately 
nondegenerate, which we describe in more detail. 

As is typical in KAM type arguments, we need a nondegeneracy assump- 
tion on the unperturbed Hamiltonian. The Hamiltonian Hq(p) is said to 
be nondegenerate if 



det 




7^ 



and isoenergetically nondegenerate if 



det 



/ ddjio Mo 
I dp'^ dp 



0 



7 ^ 0 . 



We now have the following theorem due to Arnold [1963]. 



Theorem 19.10.14 (Arnold) For Hq(p) either nondegenerate or isoen- 
ergetically nondegnerate, in a sujflcently small neighborhood of the origin 
there exists invariant tori for Hq{p) 0(5) having freqencies close to the 
linearized frequencies uii. 



Proof: See Arnold [1963]. □ 

As one might suspect, there is also a version of Nekhoroshev’s theorem 
that is valid in the neighborhood of an elliptic equilbrium point, the oji in 
(19.10.63) satisfy no resonance relations of order less than or equal to 4. In 
addition, we assume that the matrix A = (oij) is sign definite, say positive 
(if not, then multiply the Hamiltonian by —1), with spectrum bounded 
from below by m > 0 and bounded from above by M >0. 



Theorem 19.10.15 (Lochak) Consider a trajectory (z(t),z(t)) of the 
Hamiltonian vector field generated by the Hamiltonian (19.10.69), writ- 
ten in complex coordinates {z,z). There is a constant v > Q such that if 
( 0 ( 0 ), z(0)) is small enough and satisfies 

|^i(0)| > iz II z(0) j = l,...,n, 

where |[ • |[ is the Euclidean norm, then one has 

IUi(f)-^(0)||<^||z(0)f+^, j = l,...,n, 
provided that t satisfies 



|t|<rexp(|| z{0) ir^), 
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where T is some strictly positive constant. 

Proof: See Lochak [1992]. □ 

Polar coordinates are the natural “action-angle” coordinates in the neigh- 
borhood of an elliptic equilibrium point. However, using polar coordinates 
can present difficulties when studying trajectories in a neighborhood of an 
equilibrium point. The reason for this is that a singularity develops when 
a component of the radius vector goes to zero. This problem has been 
overcome in the recent work of Niederman [1998] and Fasso et al. [1998]. 

19.10F Hamiltonian Normal Forms and Symmetries 

In this subsection we will deal with the situation where the Hamiltonian 
which we are transforming to normal form has a symmetry. The main 
result is that the normal form, through any finite order which we wish to 
calculate, will have the same symmetry. All the results in this subsection 
are from Churchill et al. [1983]. 

We recall our earlier notation. Let 



r = 

denote the space of (scalar valued) form power series, and Vr will denote the 
real valued homogeneous polynomials of order r. We assume that H 2 G V 2 
splits V (recall definition 19.10.2). 

The following is our assumption on the nature of the group action. 

Assumption 19.10.1 (The Group Action) Let Q denote a group. We 
assume that it acts linearly (on the right) on V as follows. For any g G Q, 
H, F G V: 

1. Vr-gCVr, 

2. H2-g = H 2 , 

3. {H-g,F-g} = ±{H,F}-g. 



Remarks on the Group Action Assumption: 

Let us now make several remarks intended to related to the group action 
assumption. 

1. Our viewpoint here is that the group elements act on the functions 
F G V , and they do so by ordinary composition of functions, which 
is what stands for. This is consistent with our view of the nor- 
mal form procedure, where we are successively modifying the original 
Hamiltonian. 
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2. Regarding condition 2 above, if F ■ g = F for some g & Q say that 
g fixes F, or that it is invariant with respect to g. The group element 
is said to be a symmetry of F. If F ■ g = F for all 5 G t/ we say that 
Q fixes F, or that it is invariant with respect to the group G, or that 
it is G invariant. 

3. Regarding condition 3 above, the choice of sign requires some expla- 
nation. The plus sign indicates that we are considering a symplectic 
group action, and the minus sign indicates that we are considering a 
reversing group action. In a specific application the appropriate sign 
is chosen and then fixed thereafter. The theory is developed in exactly 
the same way for each case. 

Since we have assumed that H 2 splits V we have 

Fr — Rr 0 ^r, 

where 

Rr = Im ad //2 j-p,, , Nr = Ker adpj |p^ , 

(and, recall, ad //2 (") = where denotes the Poisson 

bracket). Then we have the following lemma. 

Lemma 19.10.16 

1 . Nr and Rr are both G invariant. 

2. E G Nr and F G Rr are both fixed by an element g G G if and only if 
E + F is fixed by g. 

3. Let F[ = adn^ {F) for F G Rr. Then FI G Rr, and H is fixed by g G G 
if and only if F ■ g = ±F. 

4 . Let H ■ g = H and F ■ g = ±F for some g G G. Then adfp (H) is fixed 
by g for all j >0. 

5. Let F ■ g = F for some g G G. Then H = normal form 

through terms of order m with respect to F if and only if H ■ g has 
this property. 

Proof: 

1. Suppose E G Nr, then we have 



{H 2 ,E-g} = {H 2 -g,E-g} = ±{H 2 ,E} ■ g = 0, 



since E G Nr. Hence we have shown that E ■ g G Nr. This establishes 
that Nr is G invariant. 
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Now suppose that F € Rr- Then there exists some K € Rr such 
that ad//2 (K) = F = —{F[2tK}. We use this fact in the following 
calculation 



F-g = -{H2,K}-g = T{H2-9,K-g}, 

= {H2,{TK)-g}, 

— ((T^) ■ 5) G Rr- 

This establishes that Rr is Q invariant. 

2 . This follows from 1 and the uniqueness of the direct sum decomposi- 
tion. 

3 . First, suppose FI & Rr and F[ is fixed by g, i.e., F[ ■ g = FI. Then 

H-g = -{H2,F}-g = T{H2-9,F-g}, 

= -{H2,{±F)-g}. 



Since F[ = F[ ■ g we have 

-{H2,F} = -{H2,{±F)-g}, 

and since a,dH2\Rr is an isomorphism it follows that 

F = ±F-g. 

Now suppose F = ±F ■ g. Using our assumption on the group action 
we have 



H-g = -{H2,F}-g, 

= T{H2 ■ g,F ■ g}, 

= -{H2-g,±F-g}, 
= -{H 2 ,F} = H. 



4 . The proof is by induction. 

j= 0 : By definition, ad^;^ = identity. Hence ad^;^ (H) ■ g = a,d% {FI) 
since FI ■ g = H . 

j=l: For this case we have: 

^d^p{H)-g=-{F,H}-g = {H,F}-g, 

= ±{H-g,F-g}, 

= {H-g,±F-g}, 

= -{±F-g,H-g}, 

= -{F,H} = ad}p {H). 
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Induction Assumption: We assume that the result holds for j = 
n — 1, i.e., adj~^ (H) ■ g = ad^~^ (H). 

j = n: We now show that the result holds for j = n: 
ad^ {H)-g={adF (ad^'W))'!!, 

= -{FMV\H)}-g, 

= T{F-g,&dy\H)-g}, 

= -{±F-g,s.dy\H)-g}, 

= -{F,^<rf\H)}, 

= ad^ {H). 

5. It follows that a F ■ g = F for some g G Q then 

-{F,H,}-g = T{F,H,-g}, j = !,■■■ ■ 

Now suppose that H = ©^ 2 -^r is in normal form with respect to F 
through terms of order m, i.e., adi? (Hr) = 0, r = 2, • • • , m. Then we 
have 



-{F, Hr} ■ g = ©{T’, Hr-g} = iadp {Hr ■ g) = 0, r = 2, • • • , m. 

Hence, H ■ g is in normal form with respect to F through terms of 
order m. 

□ 

Theorem 19.10.17 Let H = G Fr be in normal form with respect 

to H 2 through terms of order (m — 1) > 2, and assume H 2 splits V. Let 

Hm — Hm © Hrm where Hm G Rraf Fra ^ Fm, and SCt Km — L'm{Hm{- 

We assume that Assumption 19.10.1 holds and that g € G fixes H . Then g 
also fixes exp {adx„^) {H). 

Proof: Since H ■ g = H, it follows from part 1 of Lemma 19.10.16 Hm = 
Hm + Hm is fixed by g. Part 2 of the lemma then implies that Hm is 
fixed by g and Hm is fixed by g. It follows from part 3 of the lemma that 
if Fm — }F 2 ^Km}^ where Km — Tm{Fm^j then Km * g — FKm. The 

theorem then follows from part 4 of Lemma 19.10.16. □ 

Summary: 

We summarize the main ideas in this section. Recall that we say that 
F = ©^ 2 '^r admits a symmetry corresponding to g£Q it F-g = H. 

• We have shown that each Hr admits the same symmetry. 

• When H 2 splits V, we have shown that Hr and Hr each admit the 
same symmetry. 
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• When H is converted into normal form with respect to H 2 through 
terms of order (m — 1) > 2, we have shown that the normal form 
through terms of order (m — 1) > 2 will have the same symmetry. 



19.10g Final Remarks 

We want to end this section with some final remarks. 

Convergence. Convergence of the normalizing transformation is dis- 
cussed in Ito [1989], see also the classic work of Siegel [1941]. A good 
review of these issues can be found in Arnold et al. [1988]. The fol- 
lowing results are known. 

1. If (19.10.63) is completely integrable, then the transforma- 
tion to Birkhoff normal form is a convergent transformation 
((Riissmann [1964]). 

2. For ’’most” Hamiltonians of the form (19.10.63), the transfor- 
mation to Birkhoff normal form is a divergent transformation 
(Siegel [1954]). 

See also Bryuno [1988], [1989a], [1989b]. 

Stability of Elliptic Equilbria. In Chapter 14 we saw that if the second 
derivative of the Hamiltonian at the elliptic equilbrium point was sign 
definite, then the equlibrium point is Liapunov stable. In the case that 
it is not sign definite (e.g., some frequencies have opposite signs), then 
for the two-degree-of-freedom case, the KAM theorem can be applied 
to study stability, since, in that case, KAM tori are codimension one 
in the energy surface and therefore bound trajectories, see Arnold 
[1961] and Meyer and Schmidt [1986]. A good general reference for 
results on this subject is Bryuno [1989c]. 

Invariants. As we saw in our development of the approach of Elphick et 
al. [1987] to general normal forms, the normal form can be expressed 
as polynomials that are invariant under some group action. Similar 
results hold for the Hamiltonian case. For general background see 
Golubitsky et al. [1988]. For examples specific to the Hamiltonian 
case see Kirk et al. [1996], Hoveijn [1992]. 
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1. Show that for F G Fs, £ Fr, 



adfriHr) G Fr+s-2- 

Using this relation, inductively verify that 

ad^p{Hr-) G Fr+j(s-2)- 
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2. Consider a two degree-of-freedom Hamiltonian in the neighborhood of an elliptic equi- 
librium. Prove that the normal form, through terms of order r, is completely integrable. 
Does the same argument apply to Hamiltonian systems with three or more degrees-of- 
freedom? 

3. Consider the Hamiltonian 



H + -f/3 + • • • + + • • • , HrGVr. 



where 




Suppose that the Hamiltonian is transformed to normal form through order m terms. 
Prove that the normal form through order m terms is invariant with respect to the flow 
generated by linear Hamiltonian vector field with Hamiltonian given by H 2 - Describe 
the group action defined by this flow? Can you relate this to the averaging result 
described in Proposition 19.10.7? 

4. Recall the definition of a Lie group action on a vector space V given in Definition 19.7.3. 
Let V — be a symplectic vector space with respect to the canonical symplectic form 
(or, in some instances we will consider as a symplectic vector space with respect 
to the complex version of the canonical symplectic form). The Lie group action is said 
to be symplectic if the map 



Py 



V ^ ■ V = p~y{v), 



is symplectic for each 7 G P. Consider the following group actions. 
Z2 acting on R^: 



{x, y) ^ {-X, -y). 



SO{2) acting on C: 



9 



■ z — e 



±e 



z. 



Zyrt acting on C: 



(p ■ z — z, z 



27T 

m 



acting on C^: 



& ■ (zi, Z2) — e^”'^22), m, n G . 



Are these group actions symplectic? 

5. Consider the symplectic group action described in Assumption 19.10.1. Relate this to 
the notion of a symplectic group action described in the previous exercise. 

6. Consider one degree-of-freedom Hamiltonians near an equilibrium point where the 
leading order terms in the Hamiltonian are quadratic and given by 

H2 = + 9^). > 0, 

H 2 — A > 0. 



Compute the normal forms in each case. Can you say anything about convergence of 
the normal form? 
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7. Consider two degree-of-freedom Hamiltonians near an equilibrium point where the 
leading order terms in the Hamiltonian are quadratic and given by 

H 2 = \piqi + ^(P 2 + <? 2 )i A, w > 0, 

H 2 = ^iPlQl + ^2P2Q2, ^1, ^2 > 0. 



Compute the normal forms in each case. For the second Hamiltonian consider both 

the cases rational and irrational. 

^2 -^2 

8 . Consider three degree-of-freedom Hamiltonians near an equilibrium point where the 
leading order terms in the Hamiltonian are quadratic and given by 

H2 = Apigi + + 92) + + 93) “2, 1^3 > 0 , 

H2 — Aipigi -1- X 2 P 2 <i 2 + "^(7*3 ^3)’ ^1’ ^2 > 0. 



Compute the normal forms in each case. For the first Hamiltonian consider both the 
cases — rational and — irrational. For the second Hamiltonian consider both the 

UJ2 

cases rational and irrational. 

A2 A2 

9. Consider one degree-of-freedom Hamiltonians defined on (or C. Compute the normal 
forms (through order 4) for Hamiltonians that are invariant with respect to the Zm 
(m = 2,3,4) and SO(2) actions described above. 

10. Prove Theorem 19.10.9. 

11. Prove Theorem 19.10.11. 

12. Consider a two-degree-of-freedom Hamiltonian system near an elliptic equilibrium 
point where the lowest order (quadratic) part of the Hamiltonian is given by 

H2{zi,Zi,Z2,Z2) = + ^|Z2|^, 1^1 . > 0. 



The “Hopf variables”, are defined as; 



Wi =2 Re {z';’^ z-2‘^1) , 
1^2 = 2 Im (zPz- 2 ‘^ 1 ) , 
Ws = iJl\zi\'^ - LJ2\z:2\^, 
W 4 = U}i\zi\'^ + UJ 2 \Z 2 \'^ . 



1 

4 



+ W^) 



/ IT4 + VF3 \ ‘^2 / IF4 - IF3 \ 

V 2o;i ) V 2cj2 / 



(a) Show that if cJi and u )2 are resonant, ad/f 2 (^) = 0 if and only if H can be 
written as a polynomial in the Hopf variables. 

(b) Show that if u)i and 102 are not resonant, adn 2 (^) = 0 if and only if H can be 
written as a polynomial in the Hopf variables VF 3 and VF 4 , i.e., as a polynomial 
in | 2 i|^ and \z 2 \‘^ ■ 

(c) Show that in the resonant case the normal form is invariant with respect to 
and in the nonresonant case it is invariant with respect to T^. 



13. Show that the truncated normal form associated with the Hamiltonian near an elliptic 
equilibrium point is completely integrable if the elliptic equilibrium point is resonant 
of multiplicity one. 




19.12 Conjugacies and Equivalences of Vector Fields 345 



14. Consider the 1:2:2 resonant elliptic equilibrium point of a three degre-of-freedom 
Hamiltonian system. 

(a) Show that this is a multiplicity two resonance. 

(b) Show that the normal form, through terms of order three, has the following form 



H = -|zi|^ + |Z2|^ + |Z 3 |^ + - Rezjz2, o e R. 

(c) Show that this normal form defines a completely integrable Hamiltonian system. 
This result is due to R. Cushman. For more information on this problem see 
Haller and Wiggins [1996]. 

15. Develop the Elphick-Tirapegui-Brachet-Coullet-Iooss normal form approach for Hamil- 
tonian normal forms. 

16. For both the Lyapunov subcenter theorem and Moser’s theorem derive conditions under 
which the orbits on the invariant two-dimensional manifold have different energies. In 
this case we say that they are “energetically isolated”. 

17. Using Hamiltonian normal form theory, for H 2 given by (19.10.65), show that the 
Hamiltonian (19.10.64) can be formally transformed to the form (19.10.66). 

18. Using Hamiltonian normal form theory, for H 2 given by (19.10.67), show that the 
Hamiltonian (19.10.64) can be formally transformed to the form (19.10.68). 

19. Can Moser’s theorem be generalized to the case of more than one pair of real eigen- 
values? 



19.12 Conjugacies and Equivalences of Vector 
Fields 

In Chapter 11 we discussed the idea of a C’’ conjugacy or coordinate trans- 
formation for maps. This was motivated by the question of how the dy- 
namics of a Poincare map were affected as the cross-section on which the 
map was defined was changed. The change in cross-section could be viewed 
as a change of coordinates for the map. A similar question arises here in 
the context of the method of normal forms; namely, the normal form of a 
vector field or map is obtained through a coordinate transformation. How 
does this coordinate transformation modify the dynamics? For maps, the 
discussion in Chapter 11 applies; however, we did not discuss the notion of 
conjugacies of vector fields. To address that topic, we begin with a defini- 
tion. 

Let 



x = f{x), x G K", (19.12.1) 

y = g{y), y G K", (19.12.2) 



be two C’’ (r > 1) vector fields defined on R" (or sufficiently large open 
sets ofM"). 
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Definition 19.12.1 The dynamics generated by the vector fields f and g 
are said to be equivalent (fc < r) if there exists a diffeomorphism h 
which takes orbits of the flow generated by f, 4>{t,x), to orbits of the flow 
generated by g, ip{t,y), preserving orientation but not necessarily param- 
eterization by time. If h does preserve parameterization by time, then the 
dynamics generated by f and g are said to be conjugate. 

We remark that, as for maps, the conjugacies do not need to be defined 
on all of M" but, rather, on appropriately chosen open sets in R". In this 
case / and g are said to be locally C* equivalent or locally C* conjugate. 





FIGURE 19.12.1. (a) Phase portrait of (2.2.124). (b) Phase portrait of (2.2.125). 



Definition 19.12.1 is slightly different than the analogous definition for 
maps (see Chapter 11). Indeed, in Definition 19.12.1 we have introduced 
an additional concept: the idea of equivalence. These differences all 
stem from the fact that in vector fields the independent variable (time) is 
continuous and in maps time is discrete. Let us illustrate these differences 
with an example. However, we exhort the reader to keep the following in 
mind. 

The purpose of Definition 19.12.1 is to provide us with a way of charac- 
terizing when two vector fields have qualitatively the same dynamics. 

Example 19.12.1. Consider the two vector fields 



Xl = X2, 
X2 = -Xl, 



(X1,X2) € 



(19.12.3) 



yi = V2, 

V 2 = -yi - yl, 



and 



{yi,y2) e R^. 



(19.12.4) 
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The phase portraits of each vector field are shown in Figure 21.1.1. Each vector 
field possesses only a single fixed point, a center at the origin. In each case the 
fixed point is surrounded by a one-parameter family of periodic orbits. Note also 
from Figure 21.1.1 that in each case the direction of motion along the periodic 
orbits is in the same sense; therefore, qualitatively these two vector fields have 
the same dynamics — the phase portraits appear identical. However, let us see if 
this is reflected by the idea of conjugacy. 

Let us denote the flow generated by (19.12.3) as 

X={X1,X2), 

and the flow generated by (19.12.4) as 

^(Ly), y = (yi,y2)- 

Suppose we have found a diffeomorphism, h, taking orbits of the flow gener- 
ated by (19.12.3) into orbits of the flow generated by (19.12.4). Then we have 

ho <f>[t,x) = 'tp(^t,h{x)^. (19.12.5) 

Equation (19.12.5) reveals an immediate problem; namely, if 4>{t,x) and 'ip{t,y) 
are periodic in t, then h o cl){t,x) and h(®)^ must have the same period 
in order for (19.12.5) to hold. However, in general, (19.12.5) cannot be satisfied. 
Consider (19.12.3) and (19.12.4). The vector field (19.12.3) is linear; therefore, all 
the periodic orbits have the same period. The vector field (19.12.4) is nonlinear; 
therefore, the period of the periodic orbits varies with the distance from the fixed 
point. Thus, (19.12.3) and (19.12.4) are not conjugate (note; one can actually 
compute the period of the orbits of (19.12.4) since it is a Hamiltonian vector field. 
It is a tedious exercise involving elliptic integrals). 



End of Example 19.12.1 

It is precisely this situation from which the idea of C* equivalence can 
rescue us, for rather than only having a diffeomorphism that maps 
orbits to orbits, we at the same time allow a reparametrization of time 
along the orbit. We make this idea more quantitative as follows. Let a{x, t) 
be an increasing function of t along orbits (note: it must be increasing in 
order to preserve orientations of orbits). Then (19.12.3) and (19.12.4) are 
equivalent if the following holds 

h o x) = ip(a{x, t), h{x)) . (19.12.6) 

Equation (19.12.6) shows that orbits of the flow generated by (19.12.3) are 
mapped to orbits of the flow generated by (19.12.4); however, the time 
dependence of the image of an orbit under h may be reparametrized in an 
orbitally dependent manner. Finally, we remark that the term “preserving 
orientation” in Definition 19.12.1 refers to the fact that the direction of 
motion along an orbit is unchanged under C* equivalence. 

Let us now consider some of the dynamical consequences of Definition 
19.12.1. 
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Proposition 19.12.2 Suppose f and g are conjugate. Then 

i) fixed points of f are mapped to fixed points of g; 

ii) T -periodic orbits of f map to T -periodic orbits of g. 

Proof: That / and g are conjugate under h implies the following 

ho 4>{t^x) = ip{t,h{x)), (19.12.7) 

Dh^ = f). (19.12.8) 

The proof of i) follows from (19.12.8) and the proof of ii) follows from 

(19.12.7) . □ 

Proposition 19.12.3 Suppose f and g are conjugate {k > 1) and 
f{xo) = 0; then Df{xo) has the same eigenvalues as Dg(^h{xo)) ■ 

Proof: We have the two vector fields x = f{x), y = g{y). By differentiating 

(19.12.7) with respect to t we have 

Dh\^f{x)=g{h{x)). (19.12.9) 



Differentiating (19.12.9) gives 



D^h\j{x) + DhlDfl= Dh^ . 


(19.12.10) 


Evaluating (19.12.10) at xq gives 




Dhl, Dfl, = ^^1.0 


(19.12.11) 


or 






(19.12.12) 



and, since similar matrices have equal eigenvalues, the proof is complete. 
□ 



The previous two propositions dealt with C* conjugacies. We next ex- 
amine the consequences of equivalence under the assumption that the 
change in parameterization by time along orbits is C^. The validity of this 
assumption must be verified in any specific application. 

Proposition 19.12.4 Suppose f and g are equivalent; then 

i) fixed points of f are mapped to fixed points of g; 

ii) periodic orbits of f are mapped to periodic orbits of g, but the periods 
need not be equal. 
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Proof: If / and g are equivalent, then 



h o (j)(t, x) = if(a{x, t), h{x)) , (19.12.13) 

where a is an increasing function of time along orbits (note: a must be 
increasing in order to preserve orientations of orbits). 

Differentiating (19.12.13) gives 

(19.12.14) 

at oa 

Therefore, (19.12.14) implies i), and ii) follows automatically since dif- 
feomorphisms map closed curves to closed curves. (If this were not true, 
then the inverse would not be continuous.) □ 

Proposition 19.12.5 Suppose f and g are C* equivalent {k > 1) 
and f{xo) = 0; then the eigenvalues of Df{xo) and the eigenvalues of 
Dg(^h{xo)) differ by a positive multiplicative constant. 

Proof: Proceeding as in the proof of Proposition 19.12.3, we have 

(^(y 

Dh\,^f{x) = —g{h{x)). (19.12.15) 

Differentiating (19.12.15) gives 



D^h\^f{x) + Dh\^ Df\x 



Evaluating at Xq gives 



^9\h{x) + 



d'^a 

dxdt 



g{h{x)). 

(19.12.16) 



Dhlo ^9\h(xo) ; 



(19.12.17) 



thus, and are similar up to the multiplicative constant 

dafdt, which is positive, since a is increasing on orbits. □ 

Example 19.12.2. Consider the vector fields 



xi = Xl, 
X2 = X2, 



{xi,X2) € 



(yi,j/2) e 



and 

yi = yi, 

V2 = 2y2, 

Qualitatively these two vector fields have the same dynamics. However, by Propo- 
sition 19.12.3 they are not equivalent, fe > 1. 



End of Example 19.12.2 
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19.12a An Application: The Hartman- Grobm an 
Theorem 

An underlying theme throughout the first chapter of this book was that 
the orbit structure near a hyperbolic fixed point was qualitatively the same 
as the orbit structure given by the associated linearized dynamical system. 
A theorem proved independently by Hartman [1960] and Grobman [1959] 
makes this precise. We will describe the situation for vector fields. 
Consider a O'" (r > 1) vector field 

x = f{x), xGM", (19.12.18) 

where / is defined on a sufficiently large open set of K”. Suppose that 
(19.12.18) has a hyperbolic fixed point at a; = xq, i.e., 

f(xo) = 0 , 

and Df{xo) has no eigenvalues on the imaginary axis. Consider the asso- 
ciated linear vector field 



i = Df{xo)^, CgR”- (19.12.19) 

Then we have the following theorem. 

Theorem 19.12.6 (Hartman and Grobman) The flow generated by 
(19.12.18) is C° conjugate to the flow generated by (19.12.19) in a neigh- 
borhood of the fixed point x = xq. 

Proof: See Arnold [1973] or Palis and deMelo [1982]. □ 

We remark that the theorem can be modified so that it applies to hy- 
perbolic fixed points of maps, and we leave it to the reader to reformulate 
the theorem along these lines. 

A point to note concerning Theorem 19.12.6 is that the conjugacy trans- 
forming the nonlinear flow into the linear flow near the hyperbolic fixed 
point is not differentiable; rather, it is a homeomorphism. This makes the 
generation of the transformation via, for example, normal form theory, not 
possible since the coordinate transformations constructed via that theory 
were power series expansions and, hence, differentiable. However, a closer 
look at normal form theory will reveal the heart of the problem with “dif- 
ferentiable linearization.” Let us expand on this with a brief discussion. 
Recall equation (19.1.8) 

x = Jx + F 2 {x) + --- + Fr-i{x) + 0{\x\''), xGK". (19.12.20) 

A sufficient condition for eliminating the 0{\x\^) terms (2 < k < r— 1) 
from (19.12.20) is that the linear operator is invertible on Hk. We 

want to explore why is noninvertible. 
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Recall 

{hk{x)) = Jhk{x) — Dhk{x)Jx, (19.12.21) 

with hk{x) G Hk, where Hk is the linear vector space of vector- valued 

homogeneous polynomials of degree k. Let us choose a basis for Hk- Suppose 
J is diagonal with eigenvalues Ai, • • • , A„ (note: if J is not diagonalizable, 
then the following argument is still valid, but with slight modifications; see 
Arnold [1983] or Bryuno [1989a]). Let Ci, 1 < i < n, be the standard basis 
of M”, i.e., 6i is an n vector with a 1 in the i**' component and zeros in the 
remaining components. Then we have 

Jci = XiCi. (19.12.22) 

As a basis for Hk we take the set of elements 

n 

^^nij = k, mj>0, (19.12.23) 

1=1 

where we consider all possible terms x™^ ■ ■ ■ a;™" of degree k multiplying 
each 6i, 1 < i < n. 

Next we consider the action of L^P\-) on each of these basis elements of 
Hk- Let 

n 

hk{x) = x^^ ■ ■ ■ x’^'^Ci, '^^rrij = k, mj > 0; (19.12.24) 

1=1 

then a simple calculation shows that 

n 

lP {hk{x)) = Jhk{x) — Dhk{x)Jx = — hk{x). (19.12.25) 

Thus, the linear operator L^P{-) is diagonal in this basis, with eigenvalues 
given by 

n 

Xi-'^rrijXj. (19.12.26) 

1=1 

Now we can see the problem. The linear operator L^j\-) will fail to be 
invertible if it has a zero eigenvalue, which in this case means 

n 

Xi = ^mjXj. (19.12.27) 

1=1 

Equation (19.12.27) is called a resonance and is the origin of the name 
“resonance terms” for the unremovable nonlinear terms in the normal form 
described in Theorem 19.1.1. The integer 

n 

1=1 
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is called the order of the resonance. Thus, the difficulty in finding a differ- 
entiable coordinate change that will linearize a vector field in the neighbor- 
hood of a hyperbolic fixed point lies in the fact that an eigenvalue of the 
linearized part may be equal to a linear combination over the nonnegative 
integers of elements from the set of eigenvalues of the linearized part. Much 
work has been done on the geometry of the resonances in the complex plane 
and on differentiable linearizations in the situations where the resonances 
are avoided. For more information we refer the reader to the fundamen- 
tal papers by Sternberg [1957], [1958] and also Arnold [1983] and Bryuno 
[1989a]. 

Let us consider the following example due to Sternberg (see also Meyer 
[1986]). Consider the vector field 

X = 2x + y'^, 

y = y, (cr, 2 /)GM". (19.12.28) 

This vector field clearly has a hyperbolic fixed point at the origin. The 
vector field linearized about the origin is given by 



X = 2x, 

y = y. (19.12.29) 

Eliminating t as the independent variable, (19.12.29) can be written as 

^ y^O. ( 19 , 12 . 30 ) 

ay y 

Solving (19.12.30), the orbits of (19.12.29) are given by 

x = cy^, (19.12.31) 

where c is a constant. Clearly (19.12.31) are analytic curves at the origin. 

Now consider the nonlinear vector field (19.12.28). Eliminating t as the 
independent variable, (19.12.28) can be written as 

^ = - + y, y^o- (19.12.32) 

ay y 

Equation (19.12.32) is a standard first-order linear equation which can be 
solved via elementary methods (see, e.g., Boyce and DiPrima [1977]). The 
solution of (19.12.32) is given by 

x = y^[k + log\y\], (19.12.33) 

where fc is a constant. Clearly (19.12.33) are but not at the origin. 
Since the property of lying on curves must be preserved under a 
change of coordinates (the chain rule), we conclude that (19.12.28) and 
(19.12.29) are but not conjugate. 
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In terms of resonances, the problem involves a second-order resonance 
(hence the problem with linearization). This can be seen as follows. Let 

Ai = 1, 

A2 = 2, 

be the eigenvalues of the linearization. Then we have 

A2 = TOiAi -I- 1112X2 , 

with mi = 2, 1712 = 0, and nij = 2. 

19 . 12 b An Application: Dynamics Near a Fixed 
POINT-SOSITAISVILl’S THEOREM 

Consider the parameter-dependent vector field 
X = Ax + f{x,y,z,e), 

y = By + g{x, y, z,e), {x, y, z, e) G x x M“ x W, 

z = Cz + h{x,y, z,e), (19.12.34) 

where 

/( 0 , 0 , 0 , 0 ) = 0 , 12 /( 0 , 0 , 0 , 0 )= 0 , 

g{0, 0, 0, 0) = 0, Dg{0, 0, 0, 0) = 0, 

/i(0,0,0,0) = 0, L»h(0,0,0,0) = 0, 

and /, g, and h are C’’ (r > 2) in some neighborhood of the orgin, A is a 
c X c matrix having eigenvalues with zero real parts, i? is a s x s matrix 
having eigenvalues with negative real parts, and C is a, ux u matrix having 
eigenvalues with positive real parts. 

The center manifold theorem tells us that near the orgin in x x x 
the flow generated by (19.12.34) is C° conjugate to the flow generated 
by the following vector held 

X = w{x, e), 

y = —y, {x, y, z, e) G x M® x M" x R^, 

i = 0, (19.12.35) 

where w{x, e) represents the C’’ vector held on the center manifold. The 
result in this form is due to Sositaisvili [1975]. 

19.13 Final Remarks on Normal Forms 

Nonuniqueness of Normal Forms. It should be clear from our discus- 
sion that normal forms need not be unique. However, it may happen 
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that certain properties of a vector field (e.g., symmetries) must be 
possessed by any normal form. Such questions are explored in Kum- 
mer [1971], Bryuno [1989a], van der Meer [1985], Baider and Churchill 
[1988], and Baider [1989]. 

Divergence of the Normalizing Transformations. In general, nor- 
mal forms are divergent. This is discussed in detail in Siegel [1941] 
and Bryuno [1989a]. However, this does not affect considerations of 
local stability. 

Computation of Normal Forms. The book of Rand and Armbruster 
[1987] describes how one may implement the computation of normal 
forms using computer algebra. Chow et al. [1990] provide an algo- 
rithm for computing normal forms, as well as a MACSYMA code. 
Ashkenazi et al. [1992] provide the same for Hamiltonian normal 
forms. 

The Method of Amplitude Expansions. This method has been used 
for many years in studies of hydrodynamic stability and has much in 
common with the center manifold reduction. This situation has been 
clarified by Coullet and Spiegel [1983]. 

The Homological Equation. We want to remark on some terminology. 
In computing the normal form of vector fields near a fixed point, the 
equation 

Dhk{y)Jy - Jhk{y) = Fk{y) 

must be solved in order to simplify the order k terms in the Taylor 
expansion of the vector field. This equation is called the homological 
equation (see Arnold [1983]). The analogous equation for maps is also 
called the homological equation. 

Nonautonomous Systems. Consider the situation of C’’ (with r as large 
as necessary) vector fields 

X = f{x), X G K". (19.13.1) 

The method of normal forms as developed in this chapter can be 
viewed as a method for simplifying the vector field in the neighbor- 
hood of a fixed point. However, suppose x{t) = x{t) is a trajectory 
of this vector field. Can the method of normal forms then be used 
to simplify the vector field in the neighborhood of a general (time- 
dependent) solution? The answer is “sometimes,” but there are asso- 
ciated difficulties. Let 

X = x{t) + y; 

then (19.13.1) becomes 



y = A{t)y + 0{\y\^), 



(19.13.2) 
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where 

A{t) = Df{x{t))- 

In applying the method of normal forms to (19.13.2), the fact that 
A{t) is time dependent causes problems. If x{t) is periodic in t, then 
A{t) is periodic. Hence, Floquet theory can be used to transform 
(19.13.2) to a vector field where the linear part is constant (this is 
described in Arnold [1983]). In this case the method of normal forms 
as developed in this chapter can then be applied. Recently, Floquet 
theory has been generalized to the quasiperiodic case by Johnson 
[1986, 1987]; using these ideas, the normal form theory can be ap- 
plied in this case also. Recent results along these lines have also been 
obtained by Jorba and Simo [1992], Treshchev [1995], and Jorba et 
al [1997]. Their results also apply to linear systems with quasiperiod- 
ically varying coefficients, and are computationally very efficient. 

Concerning center manifold theory. Sell [1978] has proved existence 
theorems for stable, unstable, and center manifolds in nonautonomous 
systems. 

Some very interesting work of Siegmund [2002] has recently appeared 
that develops the method of normal forms for very general nonau- 
tonomous vector fields. 

Smooth Linearization. There exists a number of results concerning dif- 
ferentiable coordinate changes that linearize a dynamical system (vec- 
tor field or diffeomorphism) in the neighborhood of an invariant man- 
ifold. A recent review of these results can be found in Bronstein and 
Kopanskii [1994]. 

Real Normal Forms and Complex Coordinates. We have seen nu- 
merous examples in this chapter where the use of complex coordi- 
nates simplifies normal form calculations. A systematic development 
of this approach can be found in Menck [1993]. 

Normal Forms for Stochastic Systems. Normal form theory for sto- 
chastic dynamical systems has been worked out in Namachchivaya 
and Leng [1990] and Namachchivaya and Lin [1991]. 
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Bifurcation of Fixed Points of 
Vector Fields 



Consider the parameterized vector field 

y = g{yA), 2/gR”, Agm^, (20.0.1) 

where 5 is a C’’ function on some open set in K" x The degree of 
differentiability will be determined by our need to Taylor expand (20.0.1). 
Usually will be sufficient. 

Suppose (20.0.1) has a fixed point at (y, A) = (yoi Aq), i.e., 

ff(yo,Ao) = 0. (20.0.2) 

Two questions immediately arise. 

1. Is the fixed point stable or unstable? 

2. How is the stability or instability affected as A is varied? 

To answer Question 1, the first step to take is to examine the linear vector 
field obtained by linearizing (20.0.1) about the fixed point {y, A) = (yo, Aq). 
This linear vector field is given by 

i = Dyg{yo,\o)£n ?GK". (20.0.3) 

If the fixed point is hyperbolic (i.e., none of the eigenvalues of Dyg{yo, Aq) 
lie on the imaginary axis), we know that the stability of {yo, Aq) in (20.0.1) 
is determined by the linear equation (20.0.3) (cf. Chapter 1). This also 
enables us to answer Question 2, because since hyperbolic fixed points are 
structurally stable (cf. Chapter 12), varying A slightly does not change the 
nature of the stability of the fixed point. This should be clear intuitively, 
but let us belabor the point slightly. 

We know that 

g(yo,Ao) = 0, (20.0.4) 

and that 

Dyg{yo,Xo) (20.0.5) 

has no eigenvalues on the imaginary axis. Therefore, Dyg{yo, Xq) is in- 
vertible. By the implicit function theorem, there thus exists a unique C’’ 
function, y{X), such that 



g{y{x),x) = 0 



( 20 . 0 . 6 ) 




20.1 A Zero Eigenvalue 357 



for A sufficiently close to Aq with 



J/(Ao) = yo. (20.0.7) 

Now, by continuity of the eigenvalues with respect to parameters, for A 
sufficiently close to Aq, 

Dyg{y{X),X) (20.0.8) 

has no eigenvalues on the imaginary axis. Therefore, for A sufficiently close 
to Aq, the hyperbolic fixed point {yo, Aq) of (20.0.1) persists and its stability 
type remains unchanged. To summarize, in a neighborhood of Aq an isolated 
fixed point of (20.0.1) persists and always has the same stability type. 

The real fun starts when the fixed point {yo, Aq) of (20.0.1) is not hyper- 
bolic, i.e., when Dyg{yo, Xq) has some eigenvalues on the imaginary axis. 
In this case, for A very close to Aq (and for y close to yo), radically new 
dynamical behavior can occur. For example, fixed points can be created or 
destroyed and time-dependent behavior such as periodic, quasiperiodic, or 
even chaotic dynamics can be created. In a certain sense (to be clarified 
later), the more eigenvalues on the imaginary axis, the more exotic the 
dynamics will be. 

We will begin our study by considering the simplest way in which 
Dyg{yo,Xo) can be nonhyperbolic. This is the case where Dyg{yo,Xo) has 
a single zero eigenvalue with the remaining eigenvalues having nonzero real 
parts. The question we ask in this situation is what is the nature of this 
nonhyperbolic fixed point for A close to Aq? It is under these circumstances 
where the real power of the center manifold theory becomes apparent, since 
we know that this question can be answered by studying the vector field 
(20.0.1) restricted to the associated center manifold (cf. Section 18.2). In 
this case the vector field on the center manifold will be a p-parameter fam- 
ily of one-dimensional vector fields. This represents a vast simplification of 
( 20 . 0 . 1 ). 

20.1 A Zero Eigenvalue 

Suppose that Dyg{yo, Xq) has a single zero eigenvalue with the remaining 
eigenvalues having nonzero real parts; then the orbit structure near {yo, Aq) 
is determined by the associated center manifold equation, which we write 
as 

x = f{x,y), X G fxGW’, (20.1.1) 

where g, = X — Aq. Furthermore, we know that (20.1.1) must satisfy 

( 20 . 1 . 2 ) 



/( 0 , 0 )= 0 , 

|( 0 , 0 )= 0 , 



(20.1.3) 
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Equation (20.1.2) is simply the fixed point condition and (20.1.3) is the 
zero eigenvalue condition. We remark that (20.1.1) is C’’ if (20.0.1) is C’’. 
Let us begin by studying a few specific examples. In these examples we will 
assume 

^ 

If there are more parameters in the problem (i.e., /i G p > 1), we 
will consider all, except one, as fixed. Later we will consider more carefully 
the role played by the number of parameters in the problem. We remark 
also that we have not yet precisely defined what we mean by the term 
“bifurcation.” We will consider this after the following series of examples. 



20.1a Examples 



Example 20.1.1. Consider the vector field 

X = f{x, p) = p, — , a; € /r G (20.1.4) 



It is easy to verify that 



/( 0 , 0 ) = 0 



(20.1.5) 



and 

|^( 0 , 0 ) = 0 , ( 20 . 1 . 6 ) 

but in this example we can determine much more. The set of all fixed points of 
(20.1.4) is given by 

2 r» 

— X =0 



or 

p = x'^. (20.1.7) 

This represents a parabola in the p — x plane as shown in Figure 20.1.1. 




FIGURE 20.1.1. 



In the figure the arrows along the vertical lines represent the flow generated 
by (20.1.4) along the ^-direction. Thus, for p < 0, (20.1.4) has no fixed points, 
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and the vector field is decreasing in x. For > 0, (20.1.4) has two fixed points. 
A simple linear stability analysis shows that one of the fixed points is stable 
(represented by the solid branch of the parabola), and the other fixed point is 
unstable (represented by the broken branch of the parabola). However, we hope 
that it is obvious to the reader that, given a C’’ (r > 1) vector field on having 
only two hyperbolic fixed points, one must be stable and the other unstable. 

This is an example of bifurcation. We refer to {x, fi) = (0, 0) as a bifurcation 
point and the parameter value /r = 0 as a bifurcation value. 

Figure 20.1.1 is referred to as a bifurcation diagram. This particular type of 
bifurcation (i.e., where on one side of a parameter value there are no fixed points 
and on the other side there are two fixed points) is referred to as a saddle- 
node bifurcation. Later on we will worry about seeking precise conditions on 
the vector field on the center manifold that define the saddle-node bifurcation 
unambiguously. 



End of Example 20.1.1 



Example 20.1.2. Consider the vector field 



X = f{x, p.) = px — x^ , xGR^, p G . 



It is easy to verify that 
and 



/( 0 , 0 ) = 0 

df{o,o) 



dx 



= 0 . 



Moreover, the fixed points of (20.1.8) are given by 

a; = 0 

and 

X = p 



( 20 . 1 . 8 ) 

(20.1.9) 

( 20 . 1 . 10 ) 



( 20 . 1 . 11 ) 

( 20 . 1 . 12 ) 




FIGURE 20.1.2. 
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and are plotted in Figure 20.1.2. Hence, for p < 0, there are two fixed points; 
a: = 0 is stable and x = jj. is unstable. These two fixed points coalesce at jx — 0 
and, for /r > 0, a: = 0 is unstable and x = jj, is stable. Thus, an exchange of 
stability has occurred at /r = 0. This type of bifurcation is called a transcritical 
bifurcation. 

End of Example 20.1.2 

Example 20.1.3. Consider the vector field 

X = f{x, ^) = fix — x^ , a; G fiGM.^ 

It is clear that we have 

/( 0 , 0 ) = 0 , 




FIGURE 20.1.3. 

^(0,0) = 0. (20.1.15) 

ox 

Moreover, the fixed points of (20.1.13) are given by 

a; = 0 (20.1.16) 

and 

x'^ = fi (20.1.17) 

and are plotted in Figure 20.1.3. 

Hence, for /i < 0, there is one fixed point, x = 0, which is stable. For fi > 0, 
X = 0 is still a fixed point, but two new fixed points have been created at /i = 0 
and are given by x^ = fi. In the process, x = 0 has become unstable for /r > 0, 
with the other two hxed points stable. This type of bifurcation is called a pitchfork 
bifurcation. 



(20.1.13) 

(20.1.14) 



End of Example 20.1.3 
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Example 20.1.4. Consider the vector field 

X = f{x, fi) = jj, — (20.1.18) 

It is trivial to verify that 

/(0,0) = 0 (20.1.19) 

and 

|^( 0 , 0 ) = 0 . ( 20 . 1 . 20 ) 



X 




FIGURE 20.1.4. 

Moreover, all fixed points of (20.1.18) are given by 

fj, = x^ ( 20 . 1 . 21 ) 



and are shown in Figure 20.1.4. 

However in this example, despite (20.1.19) and (20.1.20), the dynamics of 
(20.1.18) are qualitatively the same for /i > 0 and /i < 0. Namely, (20.1.18) 
possesses a unique, stable fixed point. 



End of Example 20.1.4 



20.1b What Is A “Bifurcation of a Fixed Point”? 

The term “bifurcation” is extremely general. We will begin to learn its 
uses in dynamical systems by understanding its use in describing the orbit 
structure near nonhyperbolic fixed points. Let us consider what we learned 
from the previous examples. 

In all four examples we had 



/( 0 , 0 ) = 0 
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|(0,0) = 0. 

and yet the orbit structure near /x = 0 was different in all four cases. 
Hence, knowing that a fixed point has a zero eigenvalue for /i = 0 is not 
sufficient to determine the orbit structure for A near zero. Let us consider 
each example individually. 

1. {Example 20.1.1). In this example a unique curve (or branch) of fixed 
points passed through the origin. Moreover, the curve lay entirely on 
one side of /i = 0 in the fi — x plane. 

2. {Example 20.1.2). In this example two curves of fixed points inter- 
sected at the origin in the ji — x plane. Both curves existed on either 
side of /X = 0. However, the stability of the fixed point along a given 
curve changed on passing through /x = 0. 

3. {Example 20.1.3). In this example two curves of fixed points inter- 
sected at the origin in the p. — x plane. Only one curve {x = 0) 
existed on both sides of /x = 0; however, its stability changed on 
passing through /x = 0. The other curve of fixed points lay entirely 
to one side of ^ = 0 and had a stability type that was the opposite 
of X = 0 for ^ > 0. 

4. {Example 20.1.4). This example had a unique curve of fixed points 
passing through the origin in the p — x plane and existing on both 
sides of /X = 0. Moreover, all fixed points along the curve had the same 
stability type. Hence, despite the fact that the fixed point {x,p) = 
(0, 0) was nonhyperbolic, the orbit structure was qualitatively the 
same for all p. 

We want to apply the term “bifurcation” to Examples 20.1.1, 20.1.2, and 
20.1.3 but not to Example 20.1.4 to describe the change in orbit structure 
as p passes through zero. We are therefore led to the following definition. 

Definition 20.1.1 (Bifurcation of a Fixed Point) A fixed point {x,p) 
= (0,0) of a one-parameter family of one-dimensional vector fields is said 
to undergo a bifurcation at p = 0 if the flow for p near zero and x near 
zero is not qualitatively the same as the flow near x = 0 at p = 0. 

Several remarks are now in order concerning this definition. 

Remark 1. The phrase “qualitatively the same” is a bit vague. It can be 
made precise by substituting the term “C°-equivalent” (cf. Section 19.12), 
and this is perfectly adequate for the study of the bifurcation of fixed 
points of one- dimensional vector fields. However, we will see that as we 
explore higher dimensional phase spaces and global bifurcations, how to 
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make mathematically precise the statement “two dynamical systems have 
qualitatively the same dynamics” becomes more and more ambiguous. 

Remark 2. Practically speaking, a fixed point (xg, /io) of a one-dimensional 
vector field is a bifurcation point if either more than one curve of fixed 
points passes through (xo,Mo) in the fi — x plane or if only one curve of 
fixed points passes {xg, fig) in the /r — x plane; then it (locally) lies entirely 
on one side of the line /x = /io in the fi — x plane. 

Remark 3. It should be clear from Example 20.1.4 that the condition that 
a fixed point is nonhyperbolic is a necessary but not sufficient condition for 
bifurcation to occur in one-parameter families of vector fields. 

We next turn to deriving general conditions on one-parameter families 
of one-dimensional vector fields which exhibit bifurcations exactly as in 
Examples 20.1.1, 20.1.2, and 20.1.3. 



20.1c The Saddle-Node Bifurcation 

We now want to derive conditions under which a general one-parameter 
family of one-dimensional vector fields will undergo a saddle-node bifurca- 
tion exactly as in Example 20.1.1. These conditions will involve derivatives 
of the vector field evaluated at the bifurcation point and are obtained by 
a consideration of the geometry of the curve of fixed points in the ^ — x 
plane in a neighborhood of the bifurcation point. 

Let us recall Example 20.1.1. In this example a unique curve of fixed 
points, parameterized by x, passed through (/x, a;) = (0,0). We denote the 
curve of fixed points by /x(a;) . The curve of fixed points satisfied two prop- 
erties. 

1. It was tangent to the line ^ = 0 at a; = 0, i.e., 

^( 0 ) = 0 . ( 20 . 1 . 22 ) 

ax 

2. It lay entirely to one side of ^ = 0. Locally, this will be satisfied if we 
have 

^(0) ^ 0. (20.1.23) 

Now let us consider a general, one-parameter family of one-dimensional 
vector fields. 

i = /(x,/x), a; G ^ G (20.1.24) 

Suppose (20.1.24) has a fixed point at {x,qi) = (0,0), i.e.. 



/( 0 , 0 )= 0 . 



(20.1.25) 
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Furthermore, suppose that the fixed point is not hyperbolic, i.e., 

.26) 
.27) 

then, by the implicit function theorem, there exists a unique function 



Now, if we have 



^( 0 , 0 ) = 0 . 



^( 0 . 0 ) ,^ 0 . 



(20.1 

(20.1 



= ^i{0) = 0 (20.1.28) 

defined for x sufficiently small such that f{x,ii{x)) = 0. (Note: the reader 
should check that (20.1.27) holds in Example 20.1.1.) Now we want to 
derive conditions in terms of derivatives of / evaluated at (/i, x) = (0, 0) so 
that we have 

^(0) = 0, (20.1.29) 

ax 

^(0)^ 0. (20.1.30) 

Equations (20.1.29) and (20.1.30), along with (20.1.25), (20.1.26), and 
(20.1.27), imply that (/r, x) = (0, 0) is a bifurcation point at which a saddle- 
node bifurcation occurs. 

We can derive expressions for (20.1.29) and (20.1.30) in terms of deriva- 
tives of / at the bifurcation point by implicitly differentiating / along the 
curve of fixed points. 

Using (20.1.27), we have 



f(x,fi(x)) = 0. 

Differentiating (20.1.31) with respect to x gives 

= 0 = M. 

Evaluating (20.1.32) at (/r,a;) = (0,0), we obtain 



1^(01 - 



thus we see that (20.1.26) and (20.1.27) imply that 



d/i 

dx 



( 0 ) = 0 , 



(20.1.31) 



(20.1.32) 



(20.1.33) 



(20.1.34) 



i.e., the curve of fixed points is tangent to the line /i = 0 at a; = 0. 
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X 




a) (-0(O,O)/|^(O,O)) >0; b) (-0(0, 0)/|£ (0, 0)) < 0. 



Next, let us differentiate (20.1.32) once more with respect to x to obtain 



= 0 = ^{x,fJ,{x))+2^-J-{x,^i{x))^{x) 
dx^ ox^ oxofi dx 



\ dx 

Evaluating (20.1.35) at {^i,x) = (0,0) and using (20.1.33) gives 

£( 0 , 0 ,.|( 0 , 0 , g , 0).0 



(20.1.35) 



or 



‘liL(n\ - 3i» 

|(0,0) ' 

Hence, (20.1.36) is nonzero provided we have 



(20.1.36) 
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Let us summarize. In order for (20.1.24) to undergo a saddle-node bifur- 
cation we must have 



nonhyperbolic fixed point (20.1.38) 



df 

^(0,0) ^ 0, (20.1.39) 

^(0,0) ^ 0. (20.1.40) 

Equation (20.1.39) implies that a unique curve of fixed points passes 
through (^,x) = (0,0), and (20.1.40) implies that the curve lies locally on 
one side of /i = 0. It should be clear that the sign of (20.1.36) determines 
on which side of /i = 0 the curve lies. In Figure 20.1.5 we show both cases 
without indicating stability and leave it as an exercise for the reader to 
verify the stability types of the different branches of fixed points emanating 
from the bifurcation point. 

Let us end our discussion of the saddle-node bifurcation with the follow- 
ing remark. Consider a general one-parameter family of one-dimensional 
vector fields having a nonhyperbolic fixed point at (a;,/x) = (0,0). The 
Taylor expansion of this vector field is given as follows 



/( 0 , 0 ) = 0 ' 



f{x, fi) = aofi + a\x^ + + 0(3). (20.1.41) 



Our computations show that the dynamics of (20.1.41) near (/x,a;) = (0,0) 
are qualitatively the same as one of the following vector fields 

X = ^±x^. (20.1.42) 



Hence, (20.1.42) can be viewed as the normal form for saddle-node bifur- 
cations. 

This brings up another important point. In applying the method of nor- 
mal forms there is always the question of truncation of the normal form; 
namely, how are the dynamics of the normal form including only the 0{k) 
terms modified when the higher order terms are included? We see that, 
in the study of the saddle-node bifurcation, all terms of 0(3) and higher 
could be neglected and the dynamics would not be qualitatively changed. 
The implicit function theorem was the tool that enabled us to verify this 
fact. 



20. Id The Transcritical Bifurcation 

We want to follow the same strategy as in our discussion and derivation 
of general conditions for the saddle-node bifurcation given in the previous 
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section, namely, to use the implicit function theorem to characterize the 
geometry of the curves of fixed points passing through the bifurcation point 
in terms of derivatives of the vector field evaluated at the bifurcation point. 

For the example of transcritical bifurcation discussed in Example 20.1.2, 
the orbit structure near the bifurcation point was characterized as follows. 

1. Two curves of fixed points passed through (x,/i) = (0,0), one given 
hy X = /jL, the other by a; = 0. 

2. Both curves of fixed points existed on both sides of /r = 0. 

3. The stability along each curve of fixed points changed on passing 
through fj, = 0. 

Using these three points as a guide, let us consider a general one-parameter 
family of one-dimensional vector fields 

i = /(x,/r), a: G ^ G (20.1.43) 



We assume that at (a;, fi) = (0, 0), (20.1.43) has a nonhyperbolic fixed point, 
i.e., 

/(0,0) = 0 (20.1.44) 



and 



df 

^(0,0) = 0. (20.1.45) 

Now, in Example 20.1.2 we had two curves of fixed points passing through 
(fj,,x) = (0,0). In order for this to occur it is necessary to have 



dl 



( 0 , 0 ) = 0 , 



(20.1.46) 



or else, by the implicit function theorem, only one curve of fixed points 
could pass through the origin. 

Equation (20.1.46) presents a problem if we wish to proceed as in the 
case of the saddle-node bifurcation; in that situation we used the condition 
^(0,0) yf 0 in order to conclude that a unique curve of fixed points, 
/a(x), passed through the bifurcation point. We then evaluated the vector 
field on the curve of fixed points and used implicit differentiation to derive 
local characteristics of the geometry of the curve of fixed points based on 
properties of the derivatives of the vector field evaluated at the bifurcation 
point. However, if we use Example 20.1.2 as a guide, we can extricate 
ourselves from this difficulty. 

In Example 20.1.2, a; = 0 was a curve of fixed points passing through the 
bifurcation point. We will require that to be the case for (20.1.43), so that 
(20.1.43) has the form 



i = = xF{x,iJ,), 



a; G ^ G 



(20.1.47) 
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where, by definition, we have 



F{x,n) 



X ’ 



X ^ 0 



§1(0, m), 




(20.1.48) 



Since a; = 0 is a curve of fixed points for (20.1.47), in order to obtain an 
additional curve of fixed points passing through (fj,,x) = (0,0) we need 
to seek conditions on F whereby F has a curve of zeros passing through 
(/X, x) = (0, 0) (that is not given by a; = 0). These conditions will be in terms 
of derivatives of F which, using (20.1.48), can be expressed as derivatives 
of /. 

Using (20.1.48), it is easy to verify the following 



F(0,0) = 0, 



and (most importantly) 






(20.1.49) 

(20.1.50) 

(20.1.51) 

(20.1.52) 



Now let us assume that (20.1.52) is not zero; then by the implicit function 
theorem there exists a function, fj,{x), defined for x sufficiently small, such 
that 



F{x,fi{x)) = 0. 



(20.1.53) 



Clearly, /i(x) is a curve of fixed points of (20.1.47). In order for fi{x) to not 
coincide with a; = 0 and to exist on both sides of /a = 0, we must require 
that 



0 < 




< oo. 



Implicitly differentiating (20.1.53) exactly as in the case of the saddle-node 
bifurcation we obtain 



dx^ > f(0,0) • 



(20.1.54) 



Using (20.1.49), (20.1.50), (20.1.51), and (20.1.52), (20.1.54) becomes 



dfj, 

dx 



(0) 



-§^( 0 , 0 ) 

Si(0,0)' 



(20.1.55) 
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FIGURE 20.1.6. 



a) (-0(O,O)/|^(O,O)) >0; b) (-0(0, 0)/|^(0, 0)) < 0. 



We now summarize our results. In order for a vector field 
x = f{x,fi), a; G ya G 

to undergo a transcritical bifurcation, we must have 



/( 0 , 0 ) = 0 

g ( 0 , 0)=0 



nonhyperbolic fixed point 



(20.1.56) 



(20.1.57) 



|( 0 , 0 ) = 0 . 



(20.1.58) 



(20.1.59) 



(20.1.60) 



We note that the slope of the curve of fixed points not equal to x = 0 is 
given by (20.1.55). These two cases are shown in Figure 20.1.6; however, we 
do not indicate stabilities of the different branches of fixed points. We leave 
it as an exercise to the reader to verify the stability types of the different 
curves of fixed points emanating from the bifurcation point. 
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Thus, (20.1.57), (20.1.58), (20.1.59), and (20.1.60) show that the orbit 
structure near (x, fi) = (0, 0) is qualitatively the same as the orbit structure 
near (x, /r) = (0, 0) of 

i = /ix =F . (20.1.61) 

Equation (20.1.61) can be viewed as a normal form for the transcritical 
bifurcation. 



20. lE The Pitchfork Bifurcation 



The discussion and derivation of conditions under which a general one- 
parameter family of one-dimensional vector fields will undergo a bifurcation 
of the type shown in Example 20.1.3 follows very closely our discussion of 
the transcritical bifurcation. 

The geometry of the curves of fixed points associated with the bifurcation 
in Example 20.1.3 had the following characteristics. 

1. Two curves of fixed points passed through (/x,x) = (0,0), one given 
by a; = 0, the other by /i = 

2. The curve a; = 0 existed on both sides of /a = 0; the curve /a = a;^ 
existed on one side of /a = 0. 

3. The fixed points on the curve a; = 0 had different stability types on 
opposite sides of ^ = 0. The fixed points on /a = a:^ all had the same 
stability type. 



Now we want to consider conditions on a general one-parameter family 
of one-dimensional vector fields having two curves of fixed points passing 
through the bifurcation point in the /j, — x plane that have the properties 



given above. 




We denote the vector field by 




x = f{x,fi), a; G ya G 


(20.1.62) 


and we suppose 




/(0,0)=0, 


(20.1.63) 


|,0,0, = 0 . 


(20.1.64) 



As in the case of the transcritical bifurcation, in order to have more than 
one curve of fixed points passing through (/a, a;) = (0, 0) we must have 



df 

^(0,0) = 0. (20.1.65) 

Proceeding further along these lines, we require a; = 0 to be a curve of fixed 
points for (20.1.62) by assuming the vector field (20.1.62) has the form 



X = xF{x, /a). 



( 20 . 1 . 66 ) 
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where 



g(o.rt. 



X ^ 0 



X = 0 



(20.1.67) 



In order to have a second curve of fixed points passing through (/r, x) = 
(0, 0) we must have 

i^(0,0) = 0 (20.1.68) 



with 

dF 

— (0,0)^ 0. (20.1.69) 

Equation (20.1.69) insures that only one additional curve of fixed points 
passes through (/r, a;) = (0,0). Also, using (20.1.69), the implicit function 
theorem implies that for x sufficiently small there exists a unique function 
/x(x) such that 

F{x,^i{x)) = 0. (20.1.70) 



In order for the curve of fixed points, ^{x), to satisfy the above-mentioned 
characteristics, it is sufficient to have 



^(0)=0 (20.1.71) 

ax 

and 

§(0)^ 0. (20.1.72) 

The conditions for (20.1.71) and (20.1.72) to hold in terms of the deriva- 
tives of F evaluated at the bifurcation point can be obtained via implicit 
differentiation of (20.1.70) along the curve of fixed points exactly as in the 
case of the saddle-node bifurcation. They are given by 



^ f (0,0) 



= 0 



and 



dx^^ ^ f (0,0) 



^0. 



(20.1.73) 



(20.1.74) 



Using (20.1.67), (20.1.73) and (20.1.74) can be expressed in terms of deriva- 
tives of / as follows 



and 



^ 0 . 

S:(0.0) ^ 



(20.1.75) 



(20.1.76) 
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We summarize as follows. In order for the vector field 

i = /(x,/r), x G ^ G (20.1.77) 

to undergo a pitchfork bifurcation at (x, /r) = (0,0), it is sufficient to have 



/( 0 , 0 ) = 0 ) 



dl 

dx 



(0,0) 



0 



nonhyperbolic fixed point 



(20.1.78) 




(a) 




(b) 



FIGURE 20.1.7. 



with 



0)) >0; b) (-0(0,0)/!^ (0,0)^ 


) <0. 


|(0,0) = 0. 


(20.1.79) 




(20.1.80) 




(20.1.81) 


^(0,0) ^0- 


(20.1.82) 



There are two possibilities for the disposition of the two branches of fixed 
points depending on the sign of (20.1.76). These two possibilities are shown 
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in Figure 20.1.7 without indicating stabilities. We leave it as an exercise for 
the reader to verify the stability types for the different branches of fixed 
points emanating from the bifurcation point. 

We conclude by noting that (20.1.78), (20.1.79), (20.1.80), (20.1.81), and 
(20.1.82) imply that the orbit structure near (a;,/x) = (0,0) is qualitatively 
the same as the orbit structure near (x, y) = (0, 0) in the vector field 

x = ^x^x^. (20.1.83) 

Thus, (20.1.83) can be viewed as a normal form for the pitchfork bifurca- 
tion. 

20. If Exercises 

1. In our development of the transcritical and pitchfork bifurcations we assumed that 
X — 0 was a trivial solution. Was this necessary? In particular, would the conditions 
for transcritical and pitchfork bifurcations change if this were not the case? 

2. Consider a (r > 1) autonomous vector field on IR^ having precisely two hyperbolic 
fixed points. Can you infer the nature of the stability of the two fixed points? How 
does the situation change if one of the fixed points is not hyperbolic? Can both fixed 
points be nonhyperbolic? Construct explicit examples illustrating each situation. 

3. Consider the saddle-node bifurcation for vector fields and Figure 20.1.5. For the case 
(-|^(0.0)/|^(0,0)) > 0, give conditions under which the upper part of the curve 
of fixed points is stable and the lower part is unstable. Alternatively, give conditions 
under which the upper part of the curve of fixed points is unstable and the lower part 
is stable. 

Repeat the exercise for the case 0)/ ^ (0; 0)^ < 0- 

4. Consider the transcritical bifurcation for vector fields and Figure 20.1.6. For the case 

(-1^(0. (0.0)) > 0, give conditions for a: — 0 to be stable for /a > 0 and 

unstable for /a < 0. Alternatively, give conditions for a: = 0 to be unstable for ^ > 0 
and stable for /i < 0. 

Repeat the exercise for the case (0, 0)/ < 0- 

5. Consider the pitchfork bifurcation for vector fields and Figure 20.1.7. For the case 

(-1^(0. (0.0)) > 0. give conditions for a: — 0 to be stable for /a > 0 and 

unstable for /a < 0. Alternatively, give conditions for a: = 0 to be unstable for ^ > 0 
and stable for /a < 0. 

Repeat the exercise for the case (0, 0)/ < 0- 

6. In Exercise 4 following Chapter 18 we computed center manifolds near the origin for 
the following one-parameter families of vector fields. Describe the bifurcations of the 
origin. In, for example, a) and a^) the parameter e multiplies a linear and nonlinear 
term, respectively. In terms of bifurcations, is there a qualitative difference in the two 
cases? What kinds of general statements can you make? 

a) 6 = -e + sv + v'^, (e.«)GS^xRk 

V — — sin 9, 

6 = —9 + v'^ + ev^ , 

V — — sin 9. 
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b) 

b') 

d) 

d') 

f) 

f') 

h) 

h') 



i) 



i') 



j) 

j') 

k) 
k') 

l) 

1 ') 



m) 



m') 



• ^ 2 
X = -x + y + x y, 

y = X + 2y + ey + y'^ , 



(x, y) G R^. 



X = ^x + y + X y, 
y = x + 2y + y^ + ey'^ 



X — 2x 2y sy, 
y — X y x"^ , 



(x, y) e : 



X — 2x -\- 2y, 
y — x + y + x"^ + ey^. 

x = -2x + 3y + sx + y^, y) g r2, 

y ^ 2x - 3y x-^ , 

X — —2x + 3y + 
y — 2a: — 3y + 



X — —X + y, 

y ^ + 2a: + ey, 

X — —X + y + ea:^, 
y — — e + e + 2a:. 



(x, y) G R^. 



X — —2x + y + 2 + ea: + y^ 2 , 
y = x — 2y + z-\-ex + xz^ , (a:, y, z) G 

z =■ X y — 2z ex x^y, 



X — — 2 x + y + 2 + sx'^ + y^2, 
y — X — 2 y-\-z-\- exy + xz'^ , 
z — x-\-y — 2 z-\- x^y. 

a: ^ -a: - y + 2^, ^ 

y — 2a: + y + ey — 2^, ?/? -2) £ K. . 

2 = X + 2y — 2, 



I 2,2 

X — —X — y + ex + 2 , 
y = 2x + y — 2 ^ + ey^, 
2 = X + 2y — 2 . 



X — —X — y — 2 + ex — yz, 
y — —X — y — z — xz, (x, y, 2) G 

2 = —X — y — z — yz, 

X — —X — y — z — yz ex^, 
y = —X — y — z — xz, 
z — —X — y — z — xy. 

i^y + a:"+£y, 
y ^ -y - X , 

X = y + X + ey , 

y = -y - ■ 

2 , 

X — X + ey, / N _ 10,2 
^2 y) e R . 

y ^ -y - X , 

2 I 2 
X — X + ey , 

y y - x^. 
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7. Center Manifolds at a Saddle-node Bifurcation Point for Vector Fields 

In developing the center manifold theory for parametrized families of vector fields, we 
dealt with equations of the following form 



X — Ax + f{x, y, s), 
V = By + g(x,y,e), 



{x, y, e) G X IR^ x ]R^, 



(20.1.84) 



where A is a c X c matrix whose eigenvalues all have zero real parts, B is an s X s 
matrix whose eigenvalues all have negative real parts, and 



/(0,0,0) = 0, D/(0,0,0) = 0, 

9(0, 0,0) = 0, D9(0,0,0) = 0. 



(20.1.85) 



The conditions Df{0, 0, 0) = 0, Dg{0, 0, 0) = 0 do not allow for terms that are linear in 
the parameter s. Clearly, this may not be the case at a saddle-node bifurcation point, 
and we want to consider this issue in this exercise. Although this could have been 
done in Chapter 18, in that chapter we were introducing only center manifold theory 
and were not really concerned with bifurcations. In this case the form of the equations 
given by (20.1.84) and (20.1.85) was the “cleanest and quickest” way to introduce the 
notion of parametrized families of center manifolds. 

We will start at a very basic level. Consider the (r as large as necessary) vector 
field 

i = F{z,e), (z,e) e X R*”. (20.1.86) 

Suppose that {z,e) — (0,0) is a fixed point of (20.1.86) at which the matrix 



D^F(0,0) 



(20.1.87) 



has c eigenvalues with zero real parts and s eigenvalues with negative real parts. 
Our goal is to apply the center manifold theory in order to examine the dynamics 
of (20.1.86) near (z,e) — (0,0). 

We rewrite Equation (20.1.86) as follows 

i ^ D^F{0, 0)z -F DeF{0, 0)e -+■ G{z, s), (20.1.88) 

where 

G(z,e)= [f(z,s) - D^F(0,0)z - Dei^(0, 0)e] = 0(2) (20.1.89) 

in z and s. Note that the term “Z)gE'(0, 0)e” in (20.1.88) is the new wrinkle — it was 
zero under our previous assumptions. For notational purposes we let 

DzF{0, 0) = M — (c s) X {c s) matrix, 

D^F{0, 0) = A — (c s) X p matrix, 

so that (20.1.88) becomes 

z ^ Mz-\- Ae-\- G{z, e). (20.1.90) 

Now let T be the (s-|-c) X (s-^c) matrix that puts M into the following block diagonal 
form 

T'^MT=(q (20.1.91) 

where A is a (c X c) matrix with all eigenvalues having zero real parts and B is an 
(s X s) matrix with all eigenvalues having negative real parts. If we let 

z^Tw, (a:,y) G R*" X (20.1.92) 

where w = (x, y), and apply this linear transformation to (20.1.90), we obtain 



) 



(5 




+ Ae + 



( f{x,y,e)\ 
\g(x,y,e) ) ’ 



(20.1.93) 



where 



A = T"^A, 
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( f(x,y,e)\ 

V V, e) J 



T-^G(T(x,v),e). 



Note that /(O, 0, 0) = 0, ^(0, 0, 0) = 0, Z)/(0, 0, 0) = 0, and 0, 0) = 0. Next, let 




where Ac corresponds to the first c rows of A, and As corresponds to the last s rows 
of A. Then (20.1.93) can be rewritten as 

(a Ac o\ f x\ f f(x,v,s)\ 

OJ) 0 ^ + ° • (20.1.94) 

\0 Ac bJ \y/ \g(x,y,s)J 

The reader should recognize that (20.1.94) is “almost” in the standard normal form for 
application of the center manifold theory. The final step would be to introduce a linear 
transformation that block diagonalizes the linear part of (20.1.94) into a (c + p) X (c+p) 
matrix with eigenvalues all having zero real parts (and p identically zero) and an (s X s) 
matrix with all eigenvalues having negative real parts. 

a) Carry out this final step and discuss applying the center manifold theorem to 
the resulting system. In particular, do the relevant theorems from Chapter 18 
go through? 




Before we work out some specific problems, let us first answer an example. 
Consider the vector field 



x-e + x +y , {x,y,e)<^v?. (20.1.95) 

y = -y + X , 



It should be clear that ((c,p,e) = (0,0,0) is a fixed point of (20.1.95). We want to 
study the orbit structure near this fixed point for € small. Rewriting (20.1.95) in the 
form of (20.1.94) gives 



0 10 
= (00 0 

,0 0 -1 



0 



We seek a center manifold of the form 

h{x^e) — ax^ + bxs + cs^ + 0(3) . 



(20.1.96) 



Utilizing the usual procedure for calculating the center manifold, we obtain 
h(a:,e) = x'^ — 2xe + 2s^ + 0(3) . 

The vector field restricted to the center manifold is then given by 

X — e + x“^ + 0(4), 

e = 0. 

Hence, a saddle-node bifurcation occurs at £ = 0. 

Now consider the following vector fields 



X — s X y , 

y = -y + . 

X — s X — y t 
y - € - y + x~^. 

I I 2 

X — £ -f- £X -f- X , 

y = -y + ■ 



(x, y, e) G R^. 
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I I 2 

, X — S £X X , 

e) . 2 

y ^ £ - y + X . 

1 3 

X — £+-^x + y-\-x , 

y ^ X -\-2y - xy. 

, a: — 2e + 2a: + 2y, 

® y = £ + a; + y + y^. 

— 2a: + 2y — a:^ , 
y ^2x -2y. 

X — £ — 2x y z yz, 
i) y — X — 2y-\-z-\- zx^ (a:, y, 2, e) G 

i = a: + y— 22 + xy. 

For each vector field, construct the center manifold and discuss the dynamics near the 
origin for e small. What types of bifurcations occur? 

8. Consider the vector field 



a: — e + a:^ + 

2) = -y + 



(a:, y, e) G 



For this vector field the tangent space approximation is sufficient for approximating 
the center manifold of the origin. Verify this statement and discuss conditions under 
which the tangent space approximation might work in general. Consider your ideas in 
the context of the following examples. 

, 2,2 
X — £x X +y, 

y ^ -y + x^. 

a: ^ £ + a:^ + a:y, 
y = -y + a: . 



a: ^ £ + a:y + y^, 
y = -y + a: . 



9. Consider the block diagonal “normal form” of (20.1.84) to which we first transformed 
the vector field in order to apply the center manifold theory. Discuss why (or why 
not) this preliminary transformation was necessary. Is this preliminary transformation 
necessary for equations of the form of (20.1.94) in order to apply the center manifold 
theory? Work out several examples to support your views and illustrate the relevant 
points. {Hint: consider the coordinatization of the center manifold and how the invari- 
ance condition is manifested in those coordinates.) 

10. Consider the following one-parameter family of two-dimensional (r as large as 
necessary) vector fields 



X = f {x; n) , {x^ fi) ^ X , 

where /(0;0) = 0 and Dxf{0,0) has a zero eigenvalue and a negative eigenvalue. 
Suppose the vector field has the following symmetry 

f(x, fi) = -f(-x, n). 

What can you then conclude concerning the symmetry of the vector field restricted 
to the center manifold for x and fi small? Can the vector field undergo a saddle-node 
bifurcation at {x,fi) — (0,0)? Can the vector field undergo a saddle-node bifurcation 
at other points (a:, fi) G R^ X R^? 
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20.2 A Pure Imaginary Pair of Eigenvalues: The 
Poincare- Andronov-Hopf Bifurcation 

We now turn to the next most simple way that a fixed point can be nonhy- 
perbolic; namely, that the matrix associated with the vector field linearized 
about the fixed point has a pair of purely imaginary eigenvalues, with the 
remaining eigenvalues having nonzero real parts. Let us be more precise. 
Recall (20.0.1), which we restate here; 

y = g{yA), 2/gR”, Agm^, (20.2.1) 

where g is C’’ (r > 5) on some sufficiently large open set containing the 
fixed point of interest. The fixed point is denoted by {y, A) = (j/q, Aq), i.e., 

0 = Ayo,Ao). (20.2.2) 

We are interested in how the orbit structure near yo changes as A is varied. 
In this situation the first thing to examine is the linearization of the vector 
field about the fixed point, which is given by 

i = Dyg{yo,Xo)^, ^ G »"• (20.2.3) 

Suppose that Dyg{yQ, Xq) has two purely imaginary eigenvalues with the 
remaining n — 2 eigenvalues having nonzero real parts. We know (cf. 
the remarks at the beginning of this chapter) that since the fixed point 
is not hyperbolic, the orbit structure of the linearized vector field near 
(j/. A) = (yo)Ao) may reveal little (and, possibly, even incorrect) informa- 
tion concerning the nature of the orbit structure of the nonlinear vector 
field (20.2.1) near (y,X) = (yo,Xo). 

Fortunately, we have a systematic procedure for analyzing this problem. 
By the center manifold theorem, we know that the orbit structure near 
{y, A) = {yo, Ao) is determined by the vector field (20.2.1) restricted to the 
center manifold. This restriction gives us a p-parameter family of vector 
fields on a two-dimensional center manifold. For now we will assume that 
we are dealing with a single, scalar parameter, i.e., p = 1. If there is more 
than one parameter in the problem, we will consider all but one of them as 
fixed. 

On the center manifold the vector field (20.2.1) has the following form 

/Re A(p) -Im A(p) \ ( x\ f f^{x,y,y)\ 

\y ) \lmX{fj.) Re X{y) ) \y J \p{x,y,y)J' 

(x,y,g) G X X (20.2.4) 

where and are nonlinear in x and y and A(/i), A(p) are the eigenvalues 
of the vector field linearized about the fixed point at the origin. 

Equation (20.2.4) was first discussed in Section 19.2. The reader should 
recall that in performing the center manifold reduction to obtain (20.2.4), 
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several preliminary steps were first implemented. Namely, first we trans- 
formed the fixed point to the origin and, then, if necessary, performed a 
linear transformation of the coordinates so that the vector field (20.2.1) 
was in the form of (20.2.4). We further remark that the eigenvalue, de- 
noted A(^), should not be confused with the general vector of parameters 
in (20.2.1), denoted A G which we subsequently restricted to a scalar 
and labeled /i. We will henceforth denote 

X{^) = a{fj,) + (20.2.5) 

and note that by our assumptions we have 

a(0) = 0, 

u;(0)yf0. (20.2.6) 

The next step is to transform (20.2.4) into normal form. This was done in 
Section 19.2. The normal form was found to be 

X = a{^j,)x - + {a{y,)x - b{y)y){x'^ + y'^) + 0{\x\^, |y|®), 

y = uj{y)x + a{y)y + {b{y)x + a{y)y){x'^ + y'^) + 0{\x\^, |y|®). 

(20.2.7) 

We will find it more convenient to work with (20.2.7) in polar coordinates. 
In polar coordinates (20.2.7) is given by 

r = a{y)r + a{y)r^ + 0{r^), 

e = uj{n) + b{ny + 0{r^). ( 20 . 2 . 8 ) 

Because we are interested in the dynamics near /x = 0, it is natural to 
Taylor expand the coefficients in (20.2.8) about /i = 0. Equation (20.2.8) 
thus becomes 



r = a'{Q)yr + a(0)r^ -I- 

6 = w(0) -I- w'(0)/x -I- 6(0)r^ -I- r^), (20.2.9) 

where “ / ” denotes differentiation with respect to y and we have used the 
fact that a(0) = 0. 

Our goal is to understand the dynamics of (20.2.9) for r small and /x 
small. This will be accomplished in two steps. 

Step 1. Neglect the higher order terms of (20.2.9) and study the resulting 
“truncated” normal form. 

Step 2. Show that the dynamics exhibited by the truncated normal form 
are qualitatively unchanged when one considers the influence of the 
previously neglected higher order terms. 
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Step 1. Neglecting the higher order terms in (20.2.9) gives 

f = dfir + ar^, 

9 = u + cn + br^, ( 20 . 2 . 10 ) 

where, for ease of notation, we define 

a'(0) = d, 
o(0) = a, 
w(0) = to, 
w'(0) = c, 

6 ( 0 ) = 6 . ( 20 . 2 . 11 ) 



In analyzing the dynamics of vector fields we have always started with the 
simplest situation; namely, we have found the fixed points and studied the 
nature of their stability. In regard to (20.2.10), however, we proceed slightly 
differently because of the nature of the coordinate system. To be precise, 
values of r > 0 and /i for which r = 0, but 6^0, correspond to periodic 
orbits of (20.2.10). We highlight this in the following lemma. 

Lemma 20.2.1 For — oo < ^ < 0 and p, sujficiently small 



{r{t),9{t)) 






( 20 . 2 . 12 ) 



is a periodic orbit for (20.2.10). 

Proof: In order to interpret (20.2.12) as a periodic orbit, we need only 
to insure that 0 is not zero. Since w is a constant independent of p, this 
immediately follows by taking p sufficiently small. □ 



We address the question of stability in the following lemma. 

Lemma 20.2.2 The periodic orbit is 

i) asymptotically stable for a < 0; 

ii) unstable for a > 0. 

Proof: The way to prove this lemma is to construct a one-dimensional 
Poincare map along the lines of Chapter 10 (and in particular. Example 
10.1.1), from which the results of this lemma follow. □ 



We note that since we must have r > 0, (20.2.12) is the only periodic 
orbit possible for (20.2.10). Hence, for /i yf 0, (20.2.10) possesses a unique 
periodic orbit having amplitude 0{y/p). Concerning the details of stability 
of the periodic orbit and whether it exists for /r > 0 or /x < 0, from (20.2.12) 
it is easy to see that there are four possibilities: 
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4. fi < 0, a < 0. 

We will examine each case individually; however, we note that in all cases 
the origin is a fixed point which is 

stable at /X = 0 for a < 0, 

unstable at /x = 0 for a > 0. 

Case 1: d > 0, a > 0. In this case the origin is an unstable fixed point for 
/X > 0 and an asymptotically stable fixed point for /x < 0, with an unstable 
periodic orbit for /x < 0 (note: the reader should realize that if the origin 
is stable for <0, then the periodic orbit should be unstable); see Figure 
20 . 2 . 1 . 

Case 2: d > 0, a < 0. In this case the origin is an asymptotically stable 
fixed point for /x < 0 and an unstable fixed point for /x > 0, with an 
asymptotically stable periodic orbit for /x > 0; see Figure 20.2.2. 

Case 3: d < 0, a > 0. In this case the origin is an unstable fixed point for 
/X < 0 and an asymptotically stable fixed point for /x > 0, with an unstable 






^=o 




n>0 



FIGURE 20.2.3. d < 0, a > 0. 
periodic orbit for ^ > 0; see Figure 20.2.3. 

Case 4: d < 0, a < 0. In this case the origin is an asymptotically stable 
fixed point for /x < 0 and an unstable fixed point for /x > 0, with an 
asymptotically stable periodic orbit for /x < 0; see Figure 20.2.4. 
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FIGURE 20.2.4. d < 0, a < 0. 



From these four cases we can make the following general remarks. 

Remark 1. For a < 0 it is possible for the periodic orbit to exist for either 
/X > 0 (Case 2) or < 0 (Case 4); however, in each case the periodic orbit 
is asymptotically stable. Similarly, for a > 0 it is possible for the periodic 
orbit to exist for either /x > 0 (Case 3) or /x < 0 (Case 1); however, in each 
case the periodic orbit is unstable. Thus, the number a tells us whether 
the bifurcating periodic orbit is stable (a < 0) or unstable (a > 0). The 
case a < 0 is referred to as a supercritical bifurcation, and the case a > 0 
is referred to as a subcritical bifurcation. 



Remark 2. Recall that 



d= -(ReA(M)) 



/x=0 



Hence, for d > 0, the eigenvalues cross from the left half-plane to the right 
half-plane as /x increases and, for d < 0, the eigenvalues cross from the right 
half-plane to the left half-plane as /x increases. For d > 0, it follows that the 
origin is asymptotically stable for /x < 0 and unstable for p, > 0. Similarly, 
for d < 0, the origin is unstable for ^ < 0 and asymptotically stable for 
/X > 0. 



Step 2. At this point we have a fairly complete analysis of the orbit structure 
of the truncated normal form near (r, /x) = (0,0). We now must consider 
Step 2 in our analysis of the normal form (20.2.9); namely, are the dynamics 
that we have found in the truncated normal form changed when the effects 
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of the neglected higher order term are considered? Fortunately, the answer 
to this question is no and is the content of the following theorem. 

Theorem 20.2.3 (Poincare- Andronov-Hopf Bifurcation) Consider 
the full normal form (20.2.9). Then, for ^ sufficiently small, Case 1, Case 
2, Case 3, and Case ) described above hold. 

Proof: We will outline a proof that uses the Poincare-Bendixson Theorem. 
We begin by considering the truncated normal form (20.2.10) and the case 
a < 0, d > 0. In this case the periodic orbit is stable and exists for fi > 0, 
and the r coordinate is given by 



r = 





FIGURE 20.2.5. 

We next choose /i > 0 sufficiently small and consider the annulus in the 
plane, A, given by 

A = {(r, 6»)| ri < r < r 2 } , 
where ri and r 2 are chosen such that 



0 < ri < Y ~~ < ’"2- 

By (20.2.10), it is easy to verify that on the boundary of A, the vector 
field given by the truncated normal form (20.2.10) is pointing strictly into 
the interior of A. Hence, A is a positive invariant region (cf. Definition 
3.0.3, Chapter 3); see Figure 20.2.5. 

It is also easy to verify that A contains no fixed points so, by the Poincare- 
Bendixson theorem, A contains a stable periodic orbit. Of course we already 
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knew this; our goal is to show that this situation still holds when the full 
normal form (20.2.9) is considered. 

Now consider the full normal form (20.2.9). By taking fx and r sufficiently 
small, the fxr^ ,r^) terms can be made much smaller than the rest 

of the normal form (i.e., the truncated normal form (20.2.10). Therefore, 
by taking ri and T 2 sufficiently small, A is still a positive invariant region 
containing no fixed points. Hence, by the Poincare-Bendixson theorem, A 
contains a stable periodic orbit. The remaining three cases can be treated 
similarly; however, in the cases where a > 0, the time-reversed flow (i.e., 
letting t — >■ —t) must be considered. □ 

To apply this theorem to specific systems, we need to know d (which is 
easy) and a. In principle, a is relatively straightforward to calculate. We 
simply carefully keep track of the coefficients in the normal form transfor- 
mation in terms of our original vector field. However, in practice, the alge- 
braic manipulations are horrendous. The explicit calculation can be found 
in Hassard, Kazarinoff, and Wan [1980], Marsden and McCracken[1976], 
and Guckenheimer and Holmes [1983]; here we will just state the result. 

At bifurcation (i.e., fj, = 0), (20.2.4) becomes 



X 

y 




\p{x,y,Q)) 



(20.2.13) 



and the coefficient a(0) = a is given by 
1 



16 



+ f 

\_J XXX ' Jx 



xyy 



+ L 



+ f 

xxy ' J 'I 



yyy\ 



1 



+ ^ \A (A +A)_f 

16 w A XX Jyy) Jc 

_ fl f2 I f2 1 

JxxJxx ' JyyJyyi ’ 



(/. 



+ fly) 



(20.2.14) 



where all partial derivatives are evaluated at the bifurcation point, i.e., 
(x,y,y) = (0,0,0). 

We end this section with some historical remarks. Usually Theorem 
20.2.3 goes by the name of the “Hopf bifurcation theorem.” However, as has 
been pointed out repeatedly by V. Arnold [1983], this is inaccurate, since 
examples of this type of bifurcation can be found in the work of Poincare 
[1892]. The first specific study and formulation of a theorem was due to 
Andronov [1929]. However, this is not to say that E. Hopf did not make 
an important contribution; while the work of Poincare and Andronov was 
concerned with two-dimensional vector fields, the theorem due to E. Hopf 
[1942] is valid in n dimensions (note: this was before the discovery of the 
center manifold theorem). For these reasons we refer to Theorem 20.2.3 as 
the Poincare-Andronov-Hopf bifurcation theorem. 
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20.2 a Exercises 

1. This exercise comes from Marsden and McCracken [1976]. Consider the following 
fields 

a) (r,e)GR+xSF 

6 = 1 , 

r = r(fi - r^)(2fi - , 

6 = 1 . 

r = r(r + ^J,)(r - fj,), 

6 = 1 . 

d) r = /rr(r^ - ^l), 

6 = 1 . 




FIGURE 20.2.6. 
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Match each of these vector fields to the appropriate phase portrait in Figure 20.2.6 and 
explain which hypotheses (if any) of the Poincare— Andronov— Hopf bifurcation theorem 
are violated. 

2. Consider the Poincare— Andronov— Hopf bifurcation theorem (Theorem 20.2.3). Work 
out all of the details of the proof outlined that uses the Poincare-Bendixson theorem. 

3. For the Poincare— Andronov-Hopf bifurcation, compute the expression for the coeffi- 
cient a given in (20.2.14). 



20.3 Stability of Bifurcations Under Perturbations 

Let us recall the central motivational question raised at the beginning of 
this chapter; namely, what is the nature of the orbit structure near a non- 
hyperbolic fixed point of a vector field? The key word to focus on in this 
question is “near.” We have seen that a nonhyperbolic fixed point can be 
either asymptotically stable or unstable. However, most importantly, we 
have seen that “nearby vector fields” can have very different orbit struc- 
tures. The phrase “nearby vector fields” was made concrete by considering 
parameterized families of vector fields; at a certain parameter value the 
fixed point was not hyperbolic, and a qualitatively different orbit structure 
existed for nearby parameter values (i.e., new solutions were created as the 
parameter was varied). There is an important, general lesson to be learned 
from this, which we state as follows. 

Pure Mathematical Lesson 

From the point of view of stability of nonhyperbolic fixed points of vector 
fields, one should not only study the orbit structure near the fixed point 
but also the local orbit structure of nearby vector fields. 

Applied Mathematical Lesson 

From the point of view of “robustness” of mathematical models, suppose 
one has a vector field possessing a nonhyperbolic fixed point. The vector 
field should then possess enough (independent) parameters so that, as the 
parameters are varied, all possible local dynamical behavior is realized in 
this particular parameterized family of vector fields. 

Before making these somewhat vague ideas more precise, let us consider 
how they are manifested in the saddle-node, transcritical, pitchfork, and 
Poincare- Andronov-Hopf bifurcations of vector fields that we have already 
studied. 

Example 20.3.1 (The Saddle-Node Bifurcation). Consider the one-parameter 
family of one-dimensional vector fields 

x = f{x,p), 2 /GR^, (20.3.1) 



with 



/( 0 , 0 ) = 0 , 



(20.3.2) 
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|(0,0) = 0, 

We saw in Section 20.1c that the conditions 


(20.3.3) 


|(0.0)#0, 


(20.3.4) 


J(o,o)/o, 


(20.3.5) 



were sufficient conditions in order for the vector field (20.3.1) to undergo a saddle- 
node bifurcation at /r = 0. The question we ask is the following. 

If a one-parameter family of one-dimensional vector fields satisfying 
(20.3.2), (20.3.3), (20.3.4), and (20.3.5) is “perturbed,” will the re- 
sulting family of one-dimensional vector helds have qualitatively the 
same dynamics? 

We will have essentially answered this question once we have explained what we 
mean by the term “perturbed.” 

We do this by hrst eliminating the parameters entirely. Consider a one- 
dimensional vector field 

X = f{x) = aox^ + 0{x^), (20.3.6) 

where, in the Taylor expansion of f{x), we have omitted the constant and 0(x) 
terms, since we want (20.3.6) to have a nonhyperbolic hxed point at x = 0. 
Because x = 0 is a nonhyperbolic fixed point, the orbit structure near x = 0 of 
vector fields near (20.3.6) may be very different. We consider vector fields close 
to (20.3.6) by embedding (20.3.6) in a one-parameter family of vector fields as 
follows 

X = f{x, fi) = fi + aox^ + 0{x^). (20.3.7) 

The addition of the term “/i” in (20.3.7) can be viewed as a perturbation of 

(20.3.6) via adding lower-order terms in the Taylor expansion of the vector field 
about the nonhyperbolic fixed point (note: “lower-order terms” means terms of 
order lower than the first nonvanishing term in the Taylor expansion). Clearly, 

(20.3.7) satisfies (20.3.2), (20.3.3), (20.3.4), and (20.3.5); hence, (x,/r) = (0,0) is 
a saddle-node bifurcation point. What about further perturbations of (20.3.7)? 
If we add terms of 0{x^) and larger, we see that this has no effect on the nature 
of the bifurcation, since the saddle-node bifurcation is completely determined by 
(20.3.2), (20.3.3), (20.3.4), and (20.3.5), i.e., by terms of 0{x^) and lower. We 
could perturb (20.3.7) further by adding lower-order terms. For example, 

X = /(x, jj,,e) = jj, + ex + aox^ 4- 0{x^). (20.3.8) 

In this case we have a two-parameter family of one-dimensional vector fields hav- 
ing a nonhyperbolic fixed point at (x, /r, e) = (0, 0, 0). However, the nature of the 
saddle-node bifurcation (i.e., the geometry of the curve(s) of hxed points pass- 
ing through the bifurcation point) is completely determined by (20.3.2), (20.3.3), 
(20.3.4), and (20.3.5). Hence, the addition of the term “ex” in (20.3.8) does not 
introduce any new dynamical phenomena into (20.3.7) (provided /r yf 0). 



End of Example 20.3.1 
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Example 20.3.2 (The Transcritical Bifurcation). Consider the one-parameter 
family of one-dimensional vector fields 



x = /(x,/i), xGR^, /i G R^, 


(20.3.9) 


with 




/(0,0) = 0, 


(20.3.10) 


g(0,0).0. 


(20.3.11) 


We saw in Section 20. Id that if (20.3.9) also satishes 






(20.3.12) 




(20.3.13) 


f 

5j((o.o)/o, 


(20.3.14) 



then a transcritical bifurcation occurs at (*, /r) = (0, 0). The conditions (20.3.12), 
(20.3.13), and (20.3.14) imply that, in the study of the orbit structure near the 
bifurcation point, terms of 0{x^) and larger in the Taylor expansion of the vector 
held about the bifurcation point do not qualitatively affect the nature of the 
bifurcation (i.e., they do not affect the geometry of the curves of hxed points 
passing through the bifurcation point). From this we concluded that a normal 
form for the transcritical bifurcation was given by 

i = /ra; =p a:^. (20.3.15) 

Now let us consider a perturbation of the transcritical bifurcation by perturbing 
this normal form. Following our discussion of the perturbation of the saddle- 
node bifurcation and upon examining the dehning conditions for the transcritical 
bifurcation given in (20.3.12), (20.3.13), and (20.3.14), we see that the only way 
to perturb (20.1.31) that may lead to qualitatively new dynamics is as follows 

x = e + nx^x^. (20.3.16) 

In Figure 20.3.1 we show what becomes of the transcritical bifurcation for 
e < 0, e = 0, and e > 0. 

From this we see that the two curves of fixed points which pass through the 
origin for e = 0 break apart into either a pair of curves of hxed points on which 
no bifurcation happens on passing through /r = 0 or a pair of saddle-node bifur- 
cations. 



End of Example 20.3.2 



Example 20.3.3 (The Pitchfork Bifurcation), 
for the pitchfork bifurcation was found to be 

i = /ix =F X € R 



From (20.1.83), the normal form 






1 



1 



(20.3.17) 
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e>o 



X 




X 




e<o 



FIGURE 20.3.1. 

Using arguments exactly like those used in Examples 20.3.1 and 20.3.2, we can 
see that the only perturbations able to affect the orbit structure near — 0 of 
(20.3.17) are 

i = e + /rx =F a:®, e G (20.3.18) 

In Figure 20.3.2 we show bifurcation diagrams for e < 0, e = 0, and e > 0. 

As in the case of transcritical bifurcation, we see that, upon perturbation, the 
two curves of fixed points which pass through {x, fi) = (0, 0) for e = 0 break up 
into either curves of fixed points exhibiting no bifurcation as jj, varies through 0 
or saddle-node bifurcations for e 0. 



End of Example 20.3.3 



Example 20.3.4. Recall the one-parameter family of one-dimensional vector 
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fields discnssed in Example 20.1.4 

x = fjL — x^, (20.3.19) 

The vector fields in this example have a nonhyperbolic fixed point at {x, jx) = 
(0, 0), but the orbit structure is qualitatively the same for all jx, i.e., no bifurcation 
occurs at (x,jx) = (0,0). 



X 




X 




X 




FIGURE 20.3.2. 

Now consider the following perturbation of (20.3.19) 

x = fx + ex — x^, (20.3.20) 



From Figure 20.3.2 (with the roles of e and fx reversed), it should be evident that 
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(20.3.20) does exhibit saddle-node bifurcations for e ^ 0. 



End of Example 20.3.4 

Example 20.3.5 (The Poincare- Andronov-Hopf Bifurcation). From Theorem 
20.2.3, the normal form for the Poincare-Andronov-Hopf bifurcation was given 
by 

r = jj,dr + ar^, (r, 0) € x , /r € (20.3.21) 

e = uJ + c^JL + br'^. (20.3.22) 

We want to consider how the bifurcation near (r, /r) = (0, 0) studied in Section 
20.2 changes as (20.3.21) is perturbed. Three points should be considered. 

1. Theorem 20.2.3 tells us that, for a 7^ 0, d 7! 0, higher order terms (i.e., 

will not affect the dynamics of (20.3.21) near (r, /i) = (0,0). 

2. Since cu is a constant, for (r, fi) small, in order to determine the nature 
of solutions bifurcating from the origin we need only worry about the r 
component of the vector field (20.3.21). 

3. Due to the structure of the linear part of the vector held, r = 0 is a hxed 
point for /r sufficiently small and no terms of even order in r are present 
in the r component of the normal form. 

Using these three points, we see that, for a 7! 0, d 7^^ 0, no perturbations allowed 
by the structure of the vector held (cf. the third point above) will qualitatively 
alter the nature of the bifurcation near (r, /i) = (0, 0). 



End of Example 20.3.5 



From Examples 20.3.1-20.3.5 we might conclude that, in one-parameter 
families of vector fields, the most “typical” bifurcations are saddle-node 
and Poincare-Andronov-Hopf. This is indeed the case, as we will show in 
Section 20.4. Moreover, these examples show that some nonhyperbolic fixed 
points are more degenerate than others in the sense that more parameters 
are needed in order to capture all possible nearby behavior. In Section 20.4 
we explore the idea of the codimension of a bifurcation, which will enable 
us to quantify these ideas. A complete theory is given by Golubitsky and 
Schaeffer [1985] and Golubitsky, Stewart, and Schaeffer [1988]. 



20.4 The Idea of the Codimension of a Bifurcation 

We have seen that some types of bifurcations (e.g., transcritical, pitchfork) 
are more degenerate than others (e.g., saddle-node). In this section we will 
attempt to make this more precise by introducing the idea of the codimen- 
sion of a bifurcation. We will do this by starting with a heuristic discussion 
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of the “big picture” of bifurcation theory. This will serve to show just how 
little is actually understood about bifurcation theory at this stage of the 
mathematical development of nonlinear dynamical systems theory. 



20.4a The “Big Picture” for Bifurcation Theory 

The first step is to eliminate the consideration of all parameters from the 
problem; instead, we consider the infinite-dimensional space of all dynami- 
cal systems, either vector fields or maps. Within this space we consider the 
subset of all structurally stable dynamical systems, which we denote as S. 
By the definition of structural stability (cf. Chapter 12), perturbations of 
structurally stable dynamical systems do not yield qualitatively new dy- 
namical phenomena. Thus, from the point of view of bifurcation theory, it 
is not dynamical systems in S that are of interest but rather dynamical sys- 
tems in the complement of S, denoted S'°, since perturbations of dynamical 
systems in can result in systems exhibiting radically different dynamical 
behavior. Thus, in order to understand the types of bifurcations that may 
occur in a class of dynamical systems, it is necessary to understand the 
structure of S‘^. 

Presumably, in order for a dynamical system to be in 5''^, the system must 
satisfy a certain number of extra conditions or constraints. When viewed 
geometrically in the infinite-dimensional function space setting, this can be 
interpreted as implying that S‘^ is a lower-dimensional “surface” contained 
in the space of dynamical systems. Here we use the word “surface” in a 
heuristic sense. More specifically, it would be nice if we could show that S‘^ 
is a codimension one submanifold. In practice, however, S‘^ may have sin- 
gular regions and therefore be more appropriately described as an algebraic 
variety (see Arnold [1983]). In any case, for our heuristic discussion, it does 
no harm for the reader to visualize as a surface. Before proceeding with 
this picture, let us first make a slight digression and define the notion of 
the “codimension of a submanifold.” 

The Codimension of a Submanifold 

Let M be an m-dimensional manifold and let N be an n-dimensional sub- 
manifold contained in M; then the codimension of N is defined to be m — n. 
Equivalently, in a coordinate setting, the codimension of N is the number 
of independent equations needed to define N . Thus, the codimension of 
a submanifold is a measure of the avoidability of the submanifold as one 
moves about the ambient space; in particular, the codimension of a sub- 
manifold N is equal to the minimum dimension of a submanifold P G M 
that intersects N such that the intersection is transversal. We have defined 
codimension in a finite-dimensional setting, which permits some intuition 
to be gained; now we move to the infinite-dimensional setting. Let M be 
an infinite-dimensional manifold and let IV be a submanifold contained in 
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M. (Note: for the definition of an infinite-dimensional manifold see Hirsch 
[1976]. Roughly speaking, an infinite-dimensional manifold is a set which 
is locally diffeomorphic to an infinite-dimensional Banach space. Because 
infinite-dimensional manifolds are discussed in this section only, and then 
mainly in a heuristic fashion, we refer the reader to the literature for the 
proper definitions.) We say that N is of codimension k if every point of 
N is contained in some open set in M which is diffeomorphic to C/ x R*, 
where U is an open set in N . This implies that k is the smallest dimension 
of a submanifold P C M that intersects N such that the intersection is 
transversal. Thus, the definition of codimension in the infinite-dimensional 
case has the same geometrical connotations as in the finite-dimensional 
case. Now we return to our main discussion. (For the case of “codimension 
oo” , in this definition is replaced with an infinite dimensional Banach 
space.) 

Suppose 5'° is a codimension one submanifold or, more generally, an 
algebraic variety. We might think of 5“ as a surface dividing the infinite- 
dimensional space of dynamical systems as depicted in Figure 20.4.1. Bi- 
furcations (i.e., topologically distinct orbit structures) occur as one passes 
through 5“. Thus, in the infinite-dimensional space of dynamical systems, 
one might define a bifurcation point as being any dynamical system which 
is structurally unstable. 




FIGURE 20.4.1. 

In this setting one might initially conclude that bifurcations seldom oc- 
cur and are unimportant, since any point p on may be perturbed to S 
by (most) arbitrarily small perturbations. One might also conclude from 
a practical point of view that dynamical systems contained in might 
not be very good models for physical systems, since any model is only an 
approximation to reality and therefore should be structurally stable. How- 
ever, suppose we have a curve 7 of dynamical systems transverse to S''^, i.e., 
a one-parameter family of dynamical systems. Then any sufficiently small 
perturbation of this curve 7 still results in a curve 7 ' transverse to 5'°. Thus, 
although any particular point on S‘^ may be removed from by (most) 
arbitrarily small perturbations, a curve transverse to S‘^ remains transverse 
to S'^ under perturbation. Bifurcation may therefore be unavoidable in a 
parameterized family of dynamical systems. This is an important point. 
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Now, even if we are able to show that S‘^ is a codimension one submani- 
fold or algebraic variety, S‘^ itself may be divided up into objects of higher 
codimension corresponding to more degenerate types of bifurcations. A par- 
ticular type of codimension k bifurcation in S‘^ would then be persistent in 
a fc-parameter family of dynamical systems transverse to the codimension 
k submanifold. 

This is essentially the program for bifurcation theory originally outlined 
by Poincare. In order to utilize it in practice one would proceed as follows. 

1. Given a specific dynamical system, determine whether or not it is 
structurally stable. 

2. If it is not structurally stable, compute the codimension of the bifur- 
cation. 

3. Embed the system in a parameterized family of systems transverse 
to the bifurcation surface with the number of parameters equal to 
the codimension of the bifurcation. These parameterized systems are 
called unfoldings or deformations and, if they contain all possible 
qualitative dynamics that can occur near the bifurcation, they are 
called universal unfoldings or versal deformations; see Arnold [1983] 
and our discussion to follow. 

4. Study the dynamics of the parametrized systems. 

In this way one obtains structurally stable families of systems. More- 
over, this provides a method for gaining a complete understanding of the 
qualitative dynamics of the space of dynamical systems with as little work 
as possible; namely, one uses the degenerate bifurcation points as “orga- 
nizing centers” around which one studies the dynamics. Because elsewhere 
the dynamical systems are structurally stable, there is no need to worry 
about the details of their dynamics; qualitatively, they will be topologically 
conjugate to the structurally stable dynamical systems in a neighborhood 
of the bifurcation point. 

This program is far from complete, and many of the problems associ- 
ated with its completion are exactly those encountered in our discussion of 
structural stability in Chapter 12. First, we must specify what we mean by 
the “infinite-dimensional space of dynamical systems.” (Usually this does 
not present any major difficulties.) Next, we must equip the space with a 
topology in order to define what we mean by a perturbation of a dynamical 
system. We have already seen (cf. Example 12.0.1 of Chapter 12) that there 
can be problems with this if the phase space is unbounded; nevertheless, 
these difficulties can usually be brought under control. The real difficulty 
is the following. Given a dynamical system, what does one need to know 
about it in order to determine whether it is in S or S'°? For vector fields on 
compact, boundaryless two-dimensional manifolds Peixoto’s theorem gives 




396 



20. Bifurcation of Fixed Points of Vector Fields 



us an answer to this question (see Theorem 12.1.4 , Chapter 12), but, in 
higher dimensions, we do not have nice analogs of this theorem. Moreover, 
the detailed structure of 5'° is certainly beyond our reach at this time. Al- 
though the situation appears hopeless, some progress has been made along 
two fronts: 

1. Local bifurcations; 

2. Global bifurcations of specific orbits. 

Since the subject of this chapter is local bifurcations we will discuss only 
this aspect. In Chapter 33 we will see examples of global bifurcations; for 
more information see Wiggins [1988]. 

Local bifurcation theory is concerned with the bifurcation of fixed points 
of vector fields and maps or with situations in which the problem can be 
cast into this form, such as in the study of bifurcations of periodic motions. 
For vector fields one can construct a local Poincare map (see Chapter 10) 
near the periodic orbit, thus reducing the problem to one of studying the 
bifurcation of a fixed point of a map, and for maps with a k periodic orbit 
one can consider the iterate of the map, thus reducing the problem 
to one of studying the bifurcation of a fixed point of the iterate of 
the map. Utilizing a procedure such as the center manifold theorem or the 
Lyapunov-Schmidt reduction (see Chow and Hale [1982]), one can usually 
reduce the problem to that of studying an equation of the form 

/(a:,A) = 0, (20.4.1) 

where x G M", A G are the system parameters and / : K" x — >■ K" is 

assumed to be sufficiently smooth. The goal is to study the nature of the 
solutions of (20.4.1) as A varies. In particular, it would be interesting to 
know for what parameter values solutions disappear or are created. These 
particular parameters are called bifurcation values, and there exists an ex- 
tensive mathematical machinery called singularity theory (see Golubitsky 
and Guillemin [1973]) that deals with such questions. Singularity theory is 
concerned with the local properties of smooth functions near a zero of the 
function. It provides a classification of the various cases based on codimen- 
sion in a spirit similar to that described in the beginning of the section. 
The reason this is possible is that the codimension k submanifolds in the 
space of all smooth functions having zeroes can be described algebraically 
by imposing conditions on derivatives of the functions. This gives us a 
way of classifying the various possible bifurcations and of computing the 
proper unfoldings or deformations. From this one might be led to believe 
that local bifurcation theory is a well-understood subject; however, this is 
not the case. The problem arises in the study of degenerate local bifurca- 
tions, specifically, in codimension k {k > 2) bifurcations of vector fields. 
Fundamental work of Takens [1974], Langford [1979], and Guckenheimer 
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[1981] has shown that, arbitrarily near these degenerate bifurcation points, 
complicated global dynamical phenomena such as invariant tori and Smale 
horseshoes may arise. These phenomena cannot be described or detected 
via singularity theory techniques. Nevertheless, when one reads or hears 
the phrase “codimension k bifurcation” in the context of bifurcations of 
fixed points of dynamical systems, it is the singularity theory recipe that 
is used to compute the codimension. For this reason we want to describe 
the singularity theory approach. 



20.4b The Approach to Local Bifurcation Theory: 
Ideas and Results from Singularity Theory 

We now want to give a brief account of the techniques from singularity 
theory that are used to determine the codimension of a local bifurcation 
and the correspondingly appropriate unfolding or versal deformation. Our 
discussion follows closely Arnold [1983]. 

We begin by specifying the infinite-dimensional space of dynamical sys- 
tems of interest. This will be the set of C” maps of M" into K™, denoted 
C”(K", M™). At this stage we can think of the elements of C'’(R”,K'") 
as either vector fields or maps; we will draw a distinction only when re- 
quired by context. Several technical issues involving C”(K",]R"‘) must now 
be addressed. 

1. Since we are interested only in local behavior, our maps need not be 
defined on all of K", but rather they need only be defined on open 
sets containing the region of interest (i.e., the fixed point). Along 
these same lines, we can be more general and consider maps of C°° 
manifolds. However, since we are concerned with local questions this 
will not be an issue; the reader should consult Arnold [1983]. 

2. As mentioned in our discussion of structural stability in Chapter 12, 
there can be technical difficulties in deciding when two dynamical 
systems are “close” when the phase space is unbounded (as is R"). 
Since we are interested only in the behavior near fixed points, we will 
be able to avoid this unpleasant issue. 

3. As we have mentioned several times, we will be interested in the orbit 
structure of dynamical systems in a sufficiently small neighborhood of 
a fixed point. This phrase is a bit ambiguous, so at this point, we want 
to spend a little effort to try to make it clearer. 

We begin by studying an example which will illustrate some of the salient 
points. Consider the vector field 

x = fi — x^+sx^, a; e ^ S (20.4.2) 




398 



20. Bifurcation of Fixed Points of Vector Fields 



where we view the term in (20.4.2) as a perturbation term. It should 

be evident (see Section 20.1c) that (20.4.2) undergoes a saddle-node bifur- 
cation at (x, jj) = (0, 0) so that, in the x — /i plane, in a sufficiently small 
neighborhood of the origin, the curve of fixed points of (20.4.2) appear as 
in Figure 20.4.2. 




FIGURE 20.4.2. 

However, (20.4.2) is so simple that we can actually compute the global 
curve of fixed points, which is given by 

fj, = — ex^ (20.4.3) 

and is shown in Figure 20.4.2. Thus, we see that, besides the saddle-node 
bifurcation point at (x,/x) = (0,0), (20.4.2) has an additional saddle-node 
bifurcation point at (x,/x) = (2/3e, 4/27e^). Clearly these two saddle-node 
bifurcation points are far apart for e small; however, this example shows 
that the size of a “sufficiently small neighborhood of a point” can vary 
from situation to situation. In this example, the “sufficiently small neigh- 
borhood” of {x,fi) = (0,0) shrinks to a point as e — >■ oo. The idea of a 
“germ” of a differentiable function has been invented in order to handle 
this ambiguity, and we refer the interested reader to Arnold [1983] for an 
introduction to this formalism. However, in this book we will not utilize 
the idea of germs but rather the less mathematically precise and more ver- 
bose approach of reminding the reader that we are always working in a 
sufficiently small neighborhood of a fixed point. 

Now that we have specified the infinite-dimensional space of dynamical 
systems, next we need some way of understanding the geometry of the 
“degenerate” dynamical systems. This is afforded by the notions of jets 
and jet spaces. We will first give some definitions and then explanations. 



Definition 20.4.1 (fc-jet of a Map) Consider f G C'’(K",K'"). The k- 
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jet of / at a; (k < r) is given by the following {k + 2)-tuple 






Thus we see that the /c-jet of a map at a point is simply the Taylor 
coefficients through order k plus the point at which they are evaluated. We 
denote the fc-jet of / at a; by 

Although it may seem a bit silly to dress up something as commonplace as 
a Taylor expansion in a new formalism, the reader will see that there is a 
definite payoff. We next consider the set of all /c-jets of C’’ {k < r) maps of 
M” into at all points of M”. 



Definition 20.4.2 (The Space of fc-jets) The set of all k-jets of C’’ 
(k < r) maps of K" into K"* at all points of K” is called the space of 
fc-jets and is denoted by 

J^(M",]R™) = {The space of k-jets ofC°°maps oJW^intoR"^} . 



The spaces have a nice linear vector space structure. In fact, 

we can identify J^(K",1R™) with for an appropriate choice of p. We 
illustrate this in the following examples. 

Example 20.4.1. = R" x R"*. 



End of Example 20.4.1 



Example 20.4.2. J^(R^,R^)is three dimensional, because points in J^(R^, R^) 

can be assigned the coordinates 

(*,/(x),g(a.)). 



End of Example 20.4.2 

Example 20.4.3. J^(R^,R^) is eight dimensional, because points in J^(R^,R^) 

can be assigned the coordinates 



{x,f{x),Df{x)), 



where Df{x) is a 2 x 2 matrix. 



End of Example 20.4.3 
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In a certain sense, the spaces can be thought of as finite- 

dimensional approximations of 

We now introduce a map that will play an important role when we discuss 
the notion of versal deformations. 

Definition 20.4.3 (The /c-jet Extension of a Map) For any map f G 

C’’(K”,K'") we define a map 

/ : K” ^ 

a; ^ Jxif) = fix), 

which we call the fc-jet extension of / (k < r). 

Thus, the fc-jet extension of / merely associates to each point in the 
phase space of the dynamical system (/) the k-jet of / at the point. We 
also remark that the fc-jet extension of / can be viewed as a map of the 
phase space of the dynamical system (/) into J^(K", K™). This remark will 
be important later on. We next introduce a new notion of transversality. 

Recall from Chapter 12 the notion of two manifolds being transversal. 
We now introduce a similar idea: the notion of a map being transverse to 
a manifold. 

Definition 20.4.4 (Transversality of a Map to a Submanifold) 

Consider a map f G C’’(M”, K™) and a submanifold M C K™. The map is 
said to be transversal to M at a point x G if either f{x) ^ M or the 
tangent space to M at f{x) and the image of the tangent space to M" at x 
under Df{x) are transversal, i.e., 

Df(x) ■ + Ty(,)M = 

The map is said to be transversal to M if it is transversal to M at any 
point X G K”. 

Let us consider an example. 

Example 20.4.4. Consider the map 

f :M} ^ 

X [x, X^). 

When viewed geometrically, the image of under / is only the parabola y = x^\ 
see Figure 20.4.3. We ask the following questions. 

1. Thinking of the x-axis as a one-dimensional submanifold of is / trans- 
verse to the x-axis? 

2. Similarly for the y-axis, is / transverse to the j/-axis? 
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y 




FIGURE 20.4.3. 

The only point at which the image of under / intersects either the a:-axis 
or j/-axis is at the origin; therefore, this is the only point at which we need to 
check transversality. Let us denote the a:-axis by X and the j/-axis by Y. Then, 
using the standard coordinates on R^, we can take 

T(o,o)X = (R\0), (20.4.4) 

T(o,o)T = (0,R^). (20.4.5) 

We have that Df{0) — (1,0); thus, on recalling Definition 20.4.4 and examining 
(20.4.4) and (20.4.5), it follows that / is transverse to Y but not transverse to 

X. 



End of Example 20.4.4 



We now state the Thom transversality theorem, which will guide the 
construction of candidates for versal deformations of degenerate dynamical 
systems. 

Theorem 20.4.5 (Thom) Let U C K", V C K™ be open sets and C be a 
submanifold of J^{U, V). The set of maps f : U ^ V whose k-jet extensions 
are transversal to C is an everywhere dense countable intersection of open 
sets in (y{U,V), where r > ro(/c,n,m) (where tq is some function of the 
indicated variables). 

Proof: See Arnold [1983]. □ 

This theorem allows us to work in finite dimensions, where geometrical 
intuition is more apparent, and to draw conclusions concerning the geome- 
try in the infinite-dimensional space of dynamical systems, C^(U, V), where 
U C K", V C K™ are open sets . At this point we have developed enough 
machinery from singularity theory to discuss the idea of the codimension 
of a local bifurcation. We now turn to this question. 
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20.4c The Codimension of a Local Bifurcation 

As mentioned in our discussion of the “big picture” of bifurcation theory, 
the problem we are immediately faced with is determining whether a dy- 
namical system is structurally stable. If we limit ourselves to the study of 
the behavior of fixed points, then the problem is much easier, because we 
know how to characterize structurally unstable fixed points — they are sim- 
ply the nonhyperbolic fixed points. The techniques from singularity theory 
we developed in 20.4 will enable us to characterize the degree of “structural 
instability” of a fixed point in a way that is similar in spirit to our discussion 
of the “big picture” of bifurcation theory given in that section. However, 
the reader should realize that these ideas are concerned with fixed point 
behavior only and as such we must often do additional work to determine 
whether nearby dynamical phenomena are properly taken into account. We 
will see several examples of this as we go along. 

Thus far we have been using the term “dynamical system” to refer to 
vector fields and maps interchangeably. At this stage we need to draw 
a slight distinction. We will be studying the fixed points of dynamical 
systems. For vector fields 



x = f{x), xGM", (20.4.6) 

this means studying the equation 

fix) = 0, (20.4.7) 

and for maps 

X !->■ g{x), X G K", (20.4.8) 

this means studying the equation 

g{x)-x = 0. (20.4.9) 

It should be clear that from an analytic point of view, (20.4.7) and (20.4.9) 
are essentially the same. However, for the remainder of this section we will 
deal solely with vector fields and leave the trivial modifications for maps 
as exercises. 

For vector fields the space of dynamical systems will be C”(C/, U) and 
the associated jet spaces J^{U, V), where C/ C K", U C are open sets. 
In particular, we will work solely in the finite-dimensional space J^{U, V). 
Within J^{U, V) we will be interested in the subset consisting of fc-jets of 
vector fields having fixed points. We denote this subset by F (for fixed 
points) and note that F has codimension n. Within F we will be interested 
in the fc-jets of vector fields having nonhyperbolic fixed points. We denote 
this subset by B (for bifurcation) and note the B has codimension n+ 1. 
We remark that the codimension of F and B is independent of k; however, 
the dimension of F and B is not independent of k. Let us now consider 
some specific examples. 
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Example 20.4.5 (One-Dimensional Vector Fields). As our space of vector fields 
we take R^). In this case is (fe-|-2)-dimensional with coordinates 

of points in J*^(R",R") given by the {k + 2)-tuples 



df d'^f 



X € 



/ec^ 



E is (fc -I- l)-dimensional (codimension 1) with coordinates of points in F given 

by 






X € . 






and B is fc-dimensional (codimension 2) with coordinates of points in B given by 



f 




xgr\ / g c’"(r\r^). 



End of Example 20.4.5 

Example 20.4.6 (n-Dimensional Vector Fields). As our space of vector fields 
we take C’'(R",R"). In this case coordinates of J*^(R",R") are given by 

[x,fix),Df{x),---,D'^f{x)), xGR", /€ C’-(R",R"). 

F is codimension n, with points in F having coordinates 

(a;,0,D/(®),...,DV(®)) , * G R", /gC"(R",R"), 

and B is codimension n -f 1, with points in B having coordinates 

(o;,0,57^),---,dV(®)) , xGR", /GC"(R",R"), 

where Df{x) represents a nonhyperbolic matrix which lies on a surface of codi- 
mension 1 in the n^-dimensional space of n x n matrices. This is explained in 
great detail in Section 20.5. 



End of Example 20.4.6 

We are now at the point where we can define the codimension of a fixed 
point. There are two possibilities for the choice of this number, and we 
closely follow the discussion of Arnold [1972]. 

Definition 20.4.6 (The Codimension of a Fixed Point) Consider 
(K",R") and the subset o/ consisting of k-jets of elements of 
C’'(R",M") that have fixed points. We denote this subset by F and note 
that F has codimension n in J^(R”,M”). Consider the k-jet of an element 
o/C'’(R”,R”) that has a nonhyperbolic fixed point. Then this k-jet lies in 
a subset of F defined by conditions on the derivatives. Suppose this subset 
of F has codimension b in J^(R",R"). Then we define the codimension of 
the fixed point to be b — n. 
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Before giving examples we want to make a few general remarks concern- 
ing this definition. 

Remark 1. Evidently k must be taken sufficiently large so that the degree 
of degeneracy of the nonhyperbolic fixed point can be specified. 

Remark 2. This definition says that the codimension of a fixed point is equal 
to the codimension of the subset of F specified by the degeneracy of the 
fixed point minus n. Thus, hyperbolic fixed points have codimension zero by 
this definition. This seems reasonable, since the notion of the codimension 
of a fixed point should somehow specify the degree of “nongenericity” of 
the fixed point. 

Remark 3. There is another way of describing the reasons why we chose this 
way of defining the codimension of a fixed point. The dynamical system in- 
duces a map of the phase space, K", into J*(K",]R"). This map is only the 
fc-jet extension map (cf. Definition 20.4.3). At hyperbolic fixed points this 
map is transverse to the subset F of Thus, hyperbolic fixed 

points cannot be destroyed by small perturbations. Hence, if the notion of 
codimension is to quantify the amount of “degeneracy” of nonhyperbolic 
fixed points, then the generic elements of F should be regarded as codi- 
mension zero. Practically speaking, this also implies that generically the 
fixed points move if they are perturbed. 

Now we consider several examples where we will compute the codimen- 
sion. 

Example 20.4.7. Consider the vector field 

X = f{x) = ax^ + 0{x^), (20.4.10) 

We are interested in studying (20.4.10) near the nonhyperbolic fixed point x = 0. 
At this point we see that the fc-jet of (20.4.10) is a typical element of the set 
B described in Example 20.4.5. Hence, using Definition 20.4.6, we conclude that 
a: = 0 is codimension 1. 



End of Example 20.4.7 



Example 20.4.8. Consider the vector field 

X = f{x) = ax^ + 0{x*), a; G (20.4.11) 

We are interested in (20.4.11) near the nonhyperbolic fixed point x = 0. As 
in Example 20.4.7, the fe-jet of (20.4.11) is contained in the set B described in 
Example 20.4.5. However, (20.4.11) is more degenerate than the typical element 
of B in that (20.4.11) also has 
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Hence, the fc-jet of (20.4.11) lies in a lower dimensional subset of B, denoted B' , 
which has codimension 3 in with points in B' having coordinates 

(^*,0,0,0, /eC7R",K"). 

Using Definition 20.4.6 we can thus conclude that a: = 0 is codimension 2. 



End of Example 20.4.8 

Example 20.4.9. In Section 20.5 we show the following. 

1. In the four-dimensional space of 2 x 2 real matrices the matrices 

(! 7 ) 

and 

(» ;) (20,4.13) 

lie on surfaces of codimension 2. 

2. In the nine-dimensional space of 3 x 3 real matrices the matrix 

/ 0 -w 0 \ 

w 0 0 (20.4.14) 

\ 0 0 0 / 

lies on a surface of codimension 3. 

Hence, using Example 20.4.6 and Definition 20.4.6, we might conclude that the 
codimension of a fixed point of a vector field whose 1-jet in (real) Jordan canonical 
form is given by (20.4.12), (20.4.13), or (20.4.14) is 2, 2, and 3, respectively. 
However, this is not quite true. By a re-parametrization of time, we can show 
that cases (20.4.12), and (20.4.14) are codimension 1 and 2, respectively. This 
is another way in which dynamics enters to slightly cloud the singularity theory 
approach to classifying the degeneracy of an equilibrium point of a vector field. 



End of Example 20.4.9 



We end this section with two remarks. 

Remark 1. The codimensions computed in these examples are for generic 
vector fields. In particular, we have not considered the possibility of symme- 
tries which would put extra constraints on eigenvalues and of derivatives 
which would result in a modification of the codimension; see Golubitsky 
and Schaeffer [1985] and Golubitsky, Stewart, and Schaeffer [1988]. 

Remark 2. We have referred to the sets F and B as “subsets” of J*(R”,1R”). 
In applying the Thom transversality theorem the question of whether or 
not they are actually submanifolds arises. The same question is also of 
interest for the higher codimension subsets of B corresponding to more 
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degenerate fixed points. In general, these subsets may have singular points 
and, thus, they will not have the structure of a submanifold. However, 
these singularities can be removed by slight perturbations and, hence, for 
our purposes, they can be treated as submanifolds. This technical point is 
treated in great detail in Gibson [1979]. 



20. 4d Construction of Versal Deformations 

We now want to develop the necessary definitions in order to discuss versal 
deformations of vector fields. We follow Arnold [1972], [1983] very closely. 
We remark the the phrase “unfolding” is often used to refer to a simi- 
lar procedure; see, e.g., Guckenheimer and Holmes [1983] (where the term 
“universal unfolding” is taken to be virtually synonymous with the phrase 
“versal deformation”) or Golubitsky and Schaeffer [1985] (where various 
subtleties in the definitions are thoroughly explored). 

Gonsider the following C”, parameter-dependent vector fields 

x = f{x,X), xGR", AgR^ (20.4.15) 

y = g{y,X), y G K”, A G R^. (20.4.16) 

We will be concerned with local behavior of these vector fields near fixed 
points. Therefore, we assume that (20.4.15) and (20.4.16) have fixed points 
at (a;o,Ao) and (t/OjAo), respectively, and we will be concerned with the 
dynamics in a sufficiently small neighborhood of these points. We now give 
a parametric version of the notion of C®-equivalence given in Definition 
19.12.1. 

Definition 20.4.7 (Parametric Version of C°-Equivalence) Equa- 
tions (20.4-15) and (20. 4-16) are said to be -equivalent (or topologically 
equivalent) if there exists a continuous map 

h:U^V, 

with U a neighborhood of {xq,Xo) and V a neighborhood of yo such that, 
for X sufficiently close to Aq, 

M-,A), 

with h{xo, Ao) = yo, is a homeomorphism that takes orbits of the flow gener- 
ated by (20.4.15) onto orbits of the flow generated by (20.4-16), preserving 
orientation but not necessarily parameterization by time. If h does preserve 
parameterization by time, then (20-4-15) and (20-4-16) are said to be C° 
conjugate (or topologically conjugate). 

In the construction of versal deformations we will be concerned with 
having the minimum number of parameters (for reasons to be discussed 
later), not too many or too few. The following definition is the first step 
toward formalizing this notion. 
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Consider the following C’’ vector field 

x = u{x,^I), xGK", (20.4.17) 



with 



u{xo,fJ.o) = 0 . 



Then we have the following definition. 



Definition 20.4.8 (Family Induced from a Deformation) Let A = 

be a continuous map defined for fj, sufficiently close to /xq with Aq = 
0(/xq). We say that (20.4.17) is induced from (20.4.15) if 



u{x,p.) = f{x,<j){fi)). 



We now can give our main definition. 

Definition 20.4.9 (Versal Deformation) Equation (20.4.15) is called 
a C° -equivalent versal deformation (or just versal deformation) of 

x = f{x,Xo) (20.4.18) 

at the point xg if every other parameterized family of C’’ vector fields that 
reduces to (20.4.18) for a particular choice of parameters is equivalent to 
a family of vector fields induced from (20.4-15). 

At this point we again remind the reader that we are working in a suf- 
ficiently small neighborhood of (xq, Aq). We are now at the stage where we 
can construct versal deformations of dynamical systems. We will deal with 
vector fields and merely state the simple modifications needed for dealing 
with maps. 

Once we have a vector field having a nonhyperbolic fixed point there are 
four steps necessary in order to construct a versal deformation. 

Steps in the Construction of a Versal Deformation 

Step 1 . Put the vector field in normal form to reduce the number of cases 
that need to be considered. 

Step 2. Truncate the normal form and embed the resulting fc-jet of the 
normal form in a parameterized family having the number of pa- 
rameters equal to the codimension of the bifurcation such that the 
parameterized family of /c-jets is transverse to the appropriate subset 
of degenerate /c-jets. 

Step 3. Appeal to the Thom transversality theorem (Theorem 20.4.5) to 
argue that in this way one has constructed a generic family. By the 
term generic family we mean a family from an everywhere dense set 
in the space of all families. 
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Step 4- Prove that the parametrized family constructed in this manner is 
actually a versal deformation. 

It should be apparent that Step 4 is by far the most difficult. Steps 1 
through 3 merely give us a procedure for constructing a parameterized 
family that we hope will be a versal deformation. The reason why it may 
not be is related to dynamics. The whole procedure is static only in nature; 
it takes into account only the nature of the fixed point. For one-dimensional 
vector fields we will see that the method will yield versal deformations, since 
the only possible orbits distinguished by C° equivalence are fixed points. 
However, for higher dimensional vector fields we will see that the method 
may not yield versal deformations and, indeed, no such versal deformation 
may exist. 

Elimination of a Parameter in One- Dimensional Vector Fields. Before 
constructing candidates for versal deformations of vector fields we men- 
tion a technical point relevant to one dimensional vector fields that 
enables us to eliminate a parameter in some circumstances. Consider an 
n-parameter family of one-dimensional vector fields of the form 

f{x, y) = P 2 X -V y^x^ ^ -I- y„x"~^ ± ax", (20.4.19) 

where 

a: G 07 ^ 0 , y = {yi, . . . , y„) G R" . 

Let us suppose that we make the following coordinate shift: 

a; 1 -^ a; -I- c, 

with c to be chosen shortly. Then (20.4.19) can be rewritten as 

f{x,y) ^ yi-\- y 2 {x -I- c) -I- ysix^ 2cx -I- c^) 

-!-•••+ yn{c"~^ -V {n — l)c"~^x H h x"~^) 

± a{c" nc"~^x -I- • • • -I- ncx"~^ x"), (20.4.20) 

where the coefficients multiplying each yt can be computed using the 
binomial formula. The terms can then be rearranged, yielding a poly- 
nomial in X of degree n. It is easy to see that after this is done the 
coefficient multiplying x"~^ is given by 

yn ± one. 

Then if we choose 

f-^n 

C = T — , 

an 

the x"~^ term does not appear. 

Hence, we have shown that given an n-parameter family of one- 
dimensional vector fields of the form 

f{x, y) = yi-^ y 2 X -V y^x^ -\ -I- ynx"~^ ± ax", (20.4.21) 
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with a translation of x given by 



X 1-^ 



it can be rewritten in the form 

f{x, fl) = fh+ fi 2 X + jliX^ H h ij,„-ix"~'^ ± ax”', 



(20.4.22) 



where the fii, i = — 1, can be computed in terms of the fii, 

i = if desired. 

Let us now consider some examples. In all of the following examples the 
origin will be the degenerate fixed point. 

Example 20.4.10 (One-Dimensional Vector Fields), 
a) Consider the vector field 



X = ax^ + 0{x^). 



(20.4.23) 



We follow the steps described above for constructing a versal deformation of 

(20.4.23). 

Step 1. Equation (20.4.23) is already in a sufficient normal form. 

Step 2. We truncate (20.4.23) to obtain 



a 7 ^ 0. 



(20.4.24) 



From Example 20.4.7, we know that a; = 0 is a degenerate fixed point of (20.4.24) 
having codimension 1. Now recall the definitions of the subsets F and B of 
J^(K^,R^) described in Section 20.4. For reference, we denote below these sub- 



manifolds of J^( 



and to their right a typical coordinate in the submanifold 

(20.4.25) 



l^) - (x,f{x) 



d£. . d^. . 



F (x,0, Jjx), gj^{x) 



B- (x,0,0,^{x) 



(20.4.26) 

(20.4.27) 



Now the 2-jet of (20.4.23) at a; = 0 is a typical point in B. A two-parameter 
family of vector fields transverse to B and F is given by 



■ Ml + f^ 2 X + ax . 



(20.4.28) 



However, this is a codimension one singularity. From the technical remark above, 
we know that through a shift of coordinates (20.4.28) can be written as 



X = p, + ax , 

and we take this as our candidate for a versal deformation. 



(20.4.29) 



Step 3. It follows from the Thom transversality theorem (Theorem 20.4.5) that 
(20.4.29) is a generic family. The reader should perform the simple calculation 
necessary to verify this fact. 
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Step 4- In Section 20.1c we showed that higher order terms do not affect the 
dynamics of (20.4.29) near — (0,0). Hence, the deformation is versal. 

We can conclude from this that the saddle-node bifurcation is generic in one- 
parameter families of vector fields. 

b) Consider the vector field 

x = ax^ + 0{x'^). (20.4.30) 



Step 1. Equation (20.4.30) is already in a sufficient normal form. 

Step 2. We truncate (20.4.30) to obtain 

X = ax^. (20.4.31) 



From Example 20.4.8, x = 0 is a codimension 2 fixed point. Eor reference, we 
denote below the subsets B' , B, and F of J®(R^,K^) discussed in Example 20.4.8 
with typical coordinates immediately to the right. 



F 

B 

B' 





(20.4.32) 




(20.4.33) 


d'^f d^f \ 

®-0,0,^(x),^(a;)j , 


(20.4.34) 


9® f \ 

*,0,0,0, ^(*)j. 


(20.4.35) 



Since this is a codimension two singularity we want to embed (20.4.31) in a 
two-parameter generic family transverse. Following the same reasoning given in 
the previous example, we obtain 



X 



= p,i + fi 2 X + ax 



3 



(20.4.36) 



as a candidate for a versal deformation. 

Step 3. It is an immediate consequence of the Thom transversality theorem (The- 
orem 20.4.5) that (20.4.36) is a generic family. The reader should perform the 
calculations necessary to verify this statement. 

Step 4- In Section 20. le, we proved that higher order terms do not qualitatively 
effect the local dynamics of (20.4.36). Hence, we have found a versal deformation 
of (20.4.30). 



End of Example 20.4.10 



Example 20.4.11 (Two-Dimensional Vector Fields), 
a) The Poineare-Andronov-Hopf Bifurcation 
Consider the vector field 

X = -ujy + 0{2), 
y = uix + 0(2). 



(20.4.37) 
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Step 1. From Example 19.2a, the normal form for this vector field in cartesian 
coordinates is 



X = —ujy + {ax — by){x^ + y^) + 0(5), 
y — oox + (bx + ay)(x^ + y^) + 0(5). (20.4.38) 

We note the important fact that due to the structure of the linear part of the 
vector field the normal form (20.4.38) contains no even-order terms. 

Step 2. We take as the truncated normal form 

X = -uiy + (ax - by){x^ -|- y^), 

y = Lox -\- {bx + ay){x'^ + y^). (20.4.39) 

From Example 20.5.5 in Section 20.5 we note that {x, y) = (0, 0) is a codimension 
2 fixed point. From Example 20.4.6 we recall the subsets B and F of J^(K^,K^) 
and denote typical coordinates on these subsets to the right. 

j2(r2^r2) _ (^^j^j;)^Df{x),D^f{x)) , (20.4.40) 

F- {x,0,Df{x),D^f{x)), (20.4.41) 

B - (x, 0, Df{x), D^f{x)) , (20.4.42) 

where Df{x) represents nonhyperbolic matrices. Now we want to embed (20.4.39) 
in a one-parameter family transverse to B, but due to the structure of the linear 
part of the vector field, we want the origin to remain a fixed point. From Section 
20.5, Example 20.5.5, the matrix 

(! 7 ) 

lies on a surface of codimension 2 in the four-dimensional space of 2 x 2 real 
matrices. Moreover, a versal deformation for (20.4.43) is given by 

( ^ -w-7\ (20.4.44) 

Vw+7 M y 

Using (20.4.44), we take as our transverse family 

X = yx — {u! + ■y)y + {ax — by){x^ + y^), 
y = {u! + y)x + jay + {bx -I- ay){x^ + y^). (20.4.45) 

This is a two-parameter family. But it is “well known” that the Poincare- 
Andronov-Hopf bifurcation is a codimension one bifurcation. However, as we 
mentioned earlier, one parameter can be removed by a re-parametrization of 
time and the coefficients on the nonlinear terms. This is done as follows. 

Let 
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where we view lo as 0{1) and fixed, and the deformation parameter 7 as small. 
In this way w + 7 is bounded away from zero. Under this rescaling of time and 
re-parametrization (20.4.45) becomes 

X = Xx — ojy -\ — — (ax — by)(x^ + y^), 

o> + 7 

y = ujx + Xy -{ — {bx + ay)(x^ + 

o> + 7 

and the factor — ^ can be absorbed into the constants a and b by a re-definition 
of the constants multiplying the nonlinear terms. 

This “elimination of a parameter by re-parametrization of time” can be viewed 
in another way. Let us work first in the complex setting. The normal form in 
complex coordinates is given by 

z = icoz + cz'^z + 0(5), 

where ui is real, c = a + ib. From Example 20.5.5 of Section 20.5, this is a complex 
codimension one fixed point, and we can take as a versal deformation 

z = {iuj + y + i^)z + cz^z + 0(5), 

where y and 7 are real. Re-writing this equation in real polar coordinates gives 

r = yr + ar^ + 0(5), 

6 — uj + ^ + br^ + 0(4). 

Hence, we see that for w -I- 7 bounded away from zero (which we are assuming 
to be true with u> 0 ( 1 ) and fixed, with 7 the small deformation parameter), the 
small parameter 7 has no qualitative effect on the dynamics. 

Step 3. It follows from the Thom transversality theorem (Theorem 20.4.5) that 
(20.4.45) is a generic family (when we take into account that the origin must 
remain a hxed point in the one-parameter family). The reader should perform 
the necessary calculations to verify this statement. 

Step 4- Theorem 20.2.3 implies that the higher order terms in the normal form do 
not qualitatively change the dynamics of (20.4.45). Hence, we have constructed 
a versal deformation of (20.4.37). 

Thus, we can conclude that, like saddle-node bifurcations, Poincare- Andronov- 
Hopf bifurcations are also generic in one-parameter families of vector fields. 

Due to the importance of the Poincare- Andronov-Hopf bifurcation, before end- 
ing our discussion let us re-examine it from a slightly different point of view. 

In polar coordinates, the normal form is given by (cf. (20.2.9)) 

r = ar^ + 0(r^), 

e = uJ + Oir^). (20.4.46) 

Our goal is to construct a versal deformation of (20.4.46). It is in this context 
that we see yet another example of the power and conceptual clarity that results 
upon transforming the system to normal form. 
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In our study of the dynamics of (20.4.46) we have seen (see Section 20.2, 
Lemma 20.2.1) that, for r sufficiently small, we need only to study 

f = ar® + 0(r®) = 0, (20.4.47) 

since 9{t) merely increases monotonically in t. 

Equation (20.4.47) looks very much like the degenerate one-dimensional vector 
field studied in Example 20.4.10 whose versal deformation yielded the pitchfork 
bifurcation. However, there is an important difference; namely, due to the struc- 
ture of the linear part of the vector field in (20.4.37), the r component of the 
vector field must have no even-order terms in r and must be zero at r = 0. 
Hence, this degenerate fixed point is codimension 1 rather than codimension 2 as 
in the pitchfork bifurcation, and a natural candidate for a versal deformation is 

r = + ar , 

e = uj. (20.4.48) 

It is the content of Theorem 20.2.3 that (20.4.48) is indeed a versal deformation. 

b) A Double-Zero Eigenvalue 
Consider the vector field 



x = y + Oi2), 

y = 0{2). (20.4.49) 

Step 1. From Example 19.1.2, the normal form for (20.4.49) is given by 

X = y-\-0{3), 

y = axy by^ 0{3). (20.4.50) 

Step 2. We take as the truncated normal form 

x = y, 

y = axy-\-by^. (20.4.51) 

From Example 20.4.9, recall that {x,y) = (0,0) is a codimension 2 fixed point. 
From Example 20.4.6 we denote the subsets B and F of J^(R^,K^) with typical 
coordinates of points in these sets to the right. 

j1(r2^r2) _ (20.4.52) 

F - {x,0,Df{x)), (20.4.53) 

H - (*,0,5/(x)) , (20.4.54) 

where Df{x) represent nonhyperbolic matrices. In Appendix 1 we show that 
these matrices form a three-dimensional surface in the four-dimensional space of 
2x2 matrices. We also show in this appendix that matrices with Jordan canonical 
form given by 



0 1 
0 0 



(20.4.55) 
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form a two-dimensional surface, B' , with B' (Z B (Z F (Z We now 

seek to embed (20.4.51) in a two-parameter family transverse to B' . In Appendix 
1 we show that a versal deformation of (20.4.55) is given by 

( ^ M . (20.4.56) 

\fJ-i f^2j 

Thus, one might take as a transverse family 
x = y, 

y = fiix + H 2 y + ax^ + bxy. (20.4.57) 

However, recall the remarks following the definition of codimension (Definition 
20.4.6). Generically, we expect the fixed points to move as the parameters are 
varied. This does not happen in (20.4.57); the origin always remains a fixed point. 
This situation is easy to remedy. 

Notice from the form of (20.4.57) that any fixed point must have j/ = 0. If we 
make the coordinate transformation 



X ^ X — Xo, 

y^y, 

and take as a versal deformation of the linear part 



(20.4.58) 



r 




{ X — Xo\ 


Vmi 




\ y ) 



(20.4.59) 



then a simple reparameterization allows us to transform (20.4.57) into 



x = y, 

y ^ + fi 2 y + axy + by^’ 



(20.4.60) 



see exercise 5 at the end of this section. We remark that in some cases it may be 
necessary for the origin to remain a fixed point as the parameters are varied; for 
example, in the case where the normal form is invariant under the transformation 
(x,y) {-x,-y). 

Step 3. It follows from the Thom transversality theorem that (20.4.60) is a generic 
family. The reader should perform the necessary calculation to verify this state- 
ment. 



Step 4- Bogdanov [1975] proved that (20.4.60) is a versal deformation. We will 
consider this question in great detail in Section 20.6. 



End of Example 20.4.11 



Practical Remarks on the Location of Parameters 

For a specific dynamical system arising in applications the number of pa- 
rameters and their locations in the equation are usually fixed. The theory 
developed in this section tells us that too many parameters (i.e., more than 
the codimension of the fixed point) are permissible provided they result in 
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a transverse family. Having more parameters than the codimension of the 
fixed point will simply require more work in enumerating all the cases. 
However, one must verify that the parameters are in the correct location 
so as to form a transverse family. We have seen that there is some freedom 
in where the parameters may be; in one dimension it is fairly obvious, in 
higher dimensions it is not as obvious, but the reader should keep in mind 
that transversality is the typical situation. 

20. 4e Exercises 

1. Let denote the set of symmetric j-linear maps of into R”'. Prove 

that it is a linear vector space. 

2. Let 

k 

P (R^,R^) = R”" X Liy^ (R’",R”") . 
j = i 

Prove that 

jk ^ R’" X P (R^,R”^) . 

Show that (R^,R^) is a linear vector space. 

3. Let U C R”' and V C R^ be open sets. Show that (t/, V) is an open subset of 

jk 

4. a) Consider the following three-parameter family of one-dimensional vector fields 

X — fi^x^ fi 2 X -\- /Ti, X G R^. (20.4.61) 

Show that by a parameter-dependent shift of x, (20.4.61) can be written as a 
two-parameter family 

X — x^ + "b • 

What are x, and Ji^ in terms of fc, /ti, /j- 2 , and ^ 13 ? 
b) Consider the following two-parameter family of one-dimensional vector fields 

X — x'^ fi 2 X -\- Pi, X G R^. (20.4.62) 

Show that by a parameter-dependent shift of x, (20.4.62) can be written as a 
one-parameter family 

-c _2 , — 

X = X + Pi . 

What are x and pi in terms of x, pi, and p 2 ? 

Discuss the results of a) and b) in terms of the codimension of a bifurcation and the 
number of parameters. 

In particular, consider part b) and address these issues in relation to a comparison of 
the saddle-node bifurcation 

2 , 

X — X -h p 

and the transcritical bifurcation 



X — X px. 
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5. Consider the following two-parameter family of planar vector fields 

X ^ y, 

y — fiix + fi 2 y ax bxy. 



(20.4.63) 



Under the shift of coordinates 



X x Xq, 

y ^ y, 



(20.4.63) becomes 



X ^ y, 

y — y^ixo + y^ix y, 2 y + a{x 2xxq + Xq) b{xo + x)y. 



(20.4.64) 



Show that by a parameter-dependent shift of y (but not x), (20.4.63) can be trans- 
formed to the form 



X = y, 

y ^ Ml + M2y + + ^^y- 

(20.4.65) 



What are and /I 2 in terms of xq, (i 2 , n, and b? 

6. Consider the “Hopf-steady state interaction”, i.e., a three dimensional vector field 
having a fixed point where the (real) Jordan canonical form of the matrix is given by 

/ 0 -LJ 0\ 

I cj 0 0 I , oj > 0. 

\ 0 0 0 / 



Show that this is a codimension two bifurcation and compute a candidate for a versal 
deformation. {Hint: Use Example 20.5.6 from Section 20.5.) 



7. 



Consider the “double-Hopf bifurcation” or the “mode interaction”, i.e., a four dimen- 
sional vector field having a fixed point where the (real) Jordan canonical form of the 
matrix is given by 



( 0 —LJi 0 0 \ 

u;i 0 0 0 I 

0 0 0 —lj2 I 

0 0 CJ2 0 / 



(^1, ^2 > 0. 



(a) For the nonresonant case, compute the codimension and a candidate for a versal 
deformation. 

(b) For the resonant case (except for 1 : 1 resonance), compute the codimension and 
a candidate for a versal deformation. 

(c) For the 1 : 1 resonant case (both semisimple and non-semisimple), compute the 
codimension and a candidate for a versal deformation. 



{Hint: Use Examples 20.5.7 and 20.5.8 from Section 20.5.) 

8. Survey the literature and find five applications where the “Hopf- steady state” bifur- 
cation arises. 

9. Survey the literature and find five applications where the a) non- resonant double-Hopf 
bifurcation arises, b) where the resonant double- Hopf bifurcation arises. In particular, 
find an example of a 1 : 1 resonant double-Hopf bifurcation in both the semsimple and 
non- semisimple case. 
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20.5 Versal Deformations of Families of Matrices 



In this section we develop the theory of versal deformations of matrices 
which we use in computing versal deformations of fixed points of dynamical 
systems. Our discussion follows Arnold [1983]. Let M be the space of n x 
n matrices with complex entries. The relation of similarity of matrices 
partitions the entire space into manifolds consisting of matrices having 
the same eigenvalues and dimensions of Jordan blocks; this partitioning is 
continuous, since the eigenvalues vary continuously. 

Suppose we have a matrix having some identical eigenvalues, and we want 
to reduce it to Jordan canonical form. This process is not stable, because 
the slightest perturbation might destroy the Jordan canonical form com- 
pletely. Thus, if the matrix is only approximately known (or the reduction 
is attempted by computer), then the procedure may yield nonsense. 

We give an example. Consider the matrix 



A(A) = 



0 A 
0 0 



The Jordan canonical form is given by 

0 1 
0 0 



A yf 0, 



with conjugating matrix 

C{\) = 



1 0 
0 1/AF 



A yf 0. 



Now, at A = 0, the Jordan canonical form of A(A) is given by 



0 0 
0 0 



with conjugating matrix 



C(0) = 



1 0 
0 1 



Therefore, C(A) is discontinuous at A = 0. 

However, even though multiple eigenvalues is an unstable situation for 
individual matrices, it is stable for parametrized families of matrices, i.e., 
perturbing the family does not remove the multiple eigenvalue matrix from 
the family. Thus, while we can reduce every member of the family to Jordan 
canonical form (as in the example above), in general, the transformation 
will depend discontinuously on the parameter. The problem we address is 
the following: 
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What is the simplest form to which a family of matrices can he 
reduced depending differentiably on the parameters by a change 
of parameters depending differentiably on the parameters? 

In the following we will construct such families and determine the mini- 
mum number of parameters, but first we must begin with some definitions 
and develop some necessary machinery. 

We will consider nx n matrices whose entries are complex numbers. Let 
Aq be such a matrix. The reason that we will be dealing (initially) with 
complex matrices is that the Jordan canonical form is more simple in this 
setting. Afterwards, we will show how the results obtained in the complex 
setting can be used for matrices with real entries. 

We first need to introduce several definitions. 

Definition 20.5.1 (Deformation of a Matrix) A deformation of Aq is 
o C’’ (r > 1) mapping 



A: 

A(A), 

where A G is some parameter space and 

^(Ao) = Aq. 

A deformation is also called a family, the variables Xi, i = 1, •••,£, are 
called the parameters, and A is called the base of the family. 

Definition 20.5.2 (Equivalence of Deformations) Two deformations 
A(A), B{X) of Aq are called equivalent if there exists a deformation of the 
identity matrix C(A) (T7(Ao) = id) with the same base such that 

B{X) = C'(A)A(A)C'-i(A). 

The following idea will be useful for reparametrizing families of matrices 
in order to reduce the number of parameters. 

Definition 20.5.3 (Family Induced from a Deformation) Let S C 

C™, A C be open sets. Consider the C’’ (r >1) mapping 

(/) : S — >■ A, 

(fin), 



with 4 >{pq) = Aq. 

The family induced from A by the mapping (j) is called {<)>* A){p) and is 
defined by 

{4>*A){p) = A{4>{p)), 
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Definition 20.5.4 (Versal, Miniversal, and Universal Deforma- 
tions) A deformation ^(A) of a matrix Aq is said to be versal if any 
deformation B{^) of Aq is equivalent to a deformation induced from A, 
i.e., 

for some change of parameters 



: S ^ A, 



with C{p,Q) = id and = Aq. 

A versal deformation is said to be universal if the inducing mapping (i.e., 
change of parameters map) is determined uniquely by the deformation B. 

A versal deformation is said to be miniversal if the dimension of the 
parameter space is the smallest possible for a versal deformation. 

At this stage it is useful to consider an example. 

Example 20.5.1. Consider the matrix 

. fo l 

0 

It should be clear that a versal deformation of Aq is given by 
B{g) = 



0 1 
0 0 



+ 



p.2 

P-3 P‘4, 



where g = {pi, p 2 , ps, P 4 ) G C^. However, B(/r) is not miniversal; a miniversal 
deformation is given by 



A(A) = 



0 1 
0 0 



0 0 
Ai A 2 



where A = (Ai, A 2 ) G C^. This can be seen by showing that B{p) is equivalent to 
a deformation induced from A(A). If we let 



C{p) = 
then it follows that 



1 + P2 0 
-pi 1 



C-\p) = 



1 



0 



1 + \ Ml 1 + M2 



A{\) = A{f{p)) = C-\g)B{p)C{p) 



0 1 
0 0 



+ 



0 0 
^3(1 + M2) — M 1 +M 4 



where we take 

4>{p) = { 4 >l{p), 4 > 2 {p)) = (1*3(1 + M 2 ) -^iM 4 ,M 1 +M 4 ) = (Ai,A 2 ) = A 
as the inducing mapping. 



End of Example 20.5.1 
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Now that we have the necessary definitions out of the way we can proceed 
toward our goal, which is to construct normal forms (miniversal deforma- 
tions) of matrices having multiple eigenvalues. It is important to know the 
number of parameters necessary and to know the conditions that the nor- 
mal form must satisfy for versality. To reach that point we must develop 
some machinery in order that the result does not appear to be “pulled out 
of the air.” 

We denote the set of all n x n matrices with complex entries by M. M 
is isomorphic to C" ; however, we will simply write M = C” . 

Now let us consider the Lie group G = GL{n, C) of all nonsingular nx n 
matrices with complex entries. GL{n,C) is a submanifold of C” . 

Definition 20.5.5 (The Adjoint Action) The group G acts on M ac- 
cording to the formula 

Adgiri = gmg~^ , {m G M, g G G) (20.5.1) 

(Ad stands for adjoint). 

Definition 20.5.6 (Orbit Under the Adjoint Action) Consider the 
orbit of an arbitrary fixed matrix Aq G M under the action of G; this is the 
set of points m G M such that m = gAog~^ for all g G G. The orbit of Aq 
under G forms a smooth submanifold of M, which we denote by N. Thus, 
from ( 20. 5. 1 ), the orbit, N , of Aq consists of all matrices similar to Aq . 

We next restate the notion of transversality of a map. (cf. Definition 
20.4.4). 

Definition 20.5.7 (Transversality of a Map) Let N C M be a smooth 
submanifold of a manifold M. Consider a C” (r >1) mapping of another 
manifold A into M and let X be a point in A such that A(A) G N . Then 
the mapping A is called transversal to A at A if the tangent space to M at 
A(A) is the sum 



TMa(x) = TNa(x) + DA{\) ■ TAx, (20.5.2) 

where DA{X) denotes the derivative of A at A; see Figure 20.5.1. 

With these two notions we can state and prove the proposition that pro- 
vides the key for constructing miniversal deformations of Jordan matrices. 

Proposition 20.5.8 If the mapping A is transversal to the orbit of Aq 
at X — Ao, then A(A) is a versal deformation. If the dimension of the 
parameter space is equal to the codimension of the orbit of Aq, then the 
deformation is miniversal. 
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A A 

• ^ 

X 



FIGURE 20.5.1. 

Proof: Unfortunately we cannot proceed to the proof directly but must go 
in a rather roundabout way through several steps and definition. First note, 
however, that a geometrical picture of Proposition 20.5.8 is given in Figure 
20.5.2. 

In Figure 20.5.2 N is codimension 2; hence, we choose the dimension of 
A to be 2. Since A(A) is transverse to at A = Aq, we thus represent it as 
a two-dimensional surface passing through Aq. We want to show that A(A) 
satisfying this geometrical picture is actually a miniversal deformation of 
Aq . To do this we will need to develop a local coordinate structure near Aq 
which describes points along the orbit of Aq and points off of the orbit of 
Aq. We begin with a definition. 





FIGURE 20.5.2. 



Definition 20.5.9 (Centralizer of a Matrix) The centralizer of a ma- 
trix u is the set of all matrices commuting with u denoted 

Zu = {v '■ [m, I'] = 0} ) [w, v] = uv — vu. 



(20.5.3) 
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It is easy to show that the centralizer of any matrix of order n is a linear 
subspace of M = C" . We leave this as an exercise for the reader. 

Now we want to develop the geometrical structure of M near A(Aq) = Ag. 
Let Z be the centralizer of the matrix Aq. Consider the set of nonsingu- 
lar matrices that contain the identity matrix (which we denote by “id”). 
Clearly this set has dimension v? . Within this set consider a smooth sub- 
manifold, P, intersecting the subspace id -I- 2^ transversely at id and having 
dimension equal to the codimension of the centralizer; see Figure 20.5.3. 
With Figure 20.5.3 in mind, consider the mapping 

$ : P X A C”", 

$ : (p. A) ^ pA(A)p"^ = $(p. A) (20.5.4) 

(we will worry about dimensions shortly). 

The following lemma will provide us with local coordinates near Aq. 





FIGURE 20.5.3. 



Lemma 20.5.10 In a neighborhood o/(id, Aq) 4) zs o local diffeomorphism. 



Proof: Before proving lemma this we need to state several facts. 

1) Consider the mapping 

z/> : G^C”', 

b — >■ bA()b~^ = tp{b). 

2 

The derivative of if at the identity is a linear mapping of TjdG onto TaqUA ■ 
Without loss of generality, we can take TidG = C”^ and = C”^. 

Denoting the derivative of ip at id by D^l>(id), we want to show that, for 
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u € C" , Dtp(id)u is given by the operation of commutation of u with Aq, 
i.e., Dtp{id)u = [t6, Aq]. 

This is easily calculated as follows 



D^p(id)u = lim 
£— >-0 



= lim 
£ — >^0 

= lim 
£ — ^0 



(id + £M)Ao(id + eu) ^ — Aq 
£ 

(Aq T £'Uvlo)(ld — £u) + Cl(£^) — Aq 



Aq — cAqU SuAq -\~ — Aq 



= uAq - AqU = [u,Aq]-, 



therefore, 

DV'(id) : C"" C”", 

u -)> [u,Ao]. 



(20.5.5) 



We make the following observation. Since dimG = dimM = n^, the di- 
mension of the centralizer is equal to the codimension of the orbit of Aq. 
This is because, roughly speaking, from (20.5.1) and (20.5.4) we can think 
of the centralizer of Aq to be the matrices that do not change Aq. Thus, 
we have 



dim 2^ = dim A, (20.5.6) 

dim P = dim N, (20.5.7) 



and 



dim A -I- dim N = . 



Now, returning to 4>, 



$ : P X A C”". 



From (20.5.6) and (20.5.7) we see that the dimensions are consistent (i.e., 
dim(P X A) = dimC" ) for 4) to be a diffeomorphism. 

We can now finally prove Lemma 20.5.10. We compute the derivative of 
4> at (id, Ao), denoted P4)(id, Aq), and examine how it acts on a typical 
element of T(id,Ao)(-P ^ ^)- Let (w. A) G T(id,Ao)(L’ ^ then we have 



Dd>{id, Xq){u, A) = {Dpd>{id, Ao), PA^(id, Aq))(m, A). (20.5.8) 



Using (20.5.4) and (20.5.5) it is easy to see that (20.5.8) is given by 



Dd>{id, Xq){u, A) = ([u, Ao], PA(Ao)A). (20.5.9) 

By construction of the submanifold P, iAp4)(id, Ao) maps T^^P isomorphi- 
cally to a space tangent to the orbit N at Ao (check dimensions and the 
fact that [u, Ao] yf 0). Also by the hypothesis of Proposition 20.5.8, DA{Xq) 
maps TxgA isomorphic ally to a space transverse to N at A(Aq) = Ao. Con- 
sequently, P4>(id, Ao) is an isomorphism between linear spaces of dimension 
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n^; thus, by the inverse function theorem, <I> is a local diffeomorphism. This 
completes our proof of Lemma 20.5.10. □ 

This lemma tells us that we have a local product structure (in terms of 
coordinates) near Aq in M (note: this is for a sufficiently small neighbor- 
hood of (id, Ao), since the inverse function theorem is only a local result); 
see Figure 20.5.4. 

Now we can finish the proof of Proposition 20.5.8. 

Let B(/i) for some fixed /i G S C C™ be an arbitrary deformation of Aq 
(i.e., for some fio & T, C C™, = Ag). In the local coordinates near 

Aq, we know that any matrix sufficiently close to Aq can be represented as 

<P{p,X)=pA{X)p-\ peP, AgAcC^ 
fdence, for p — po sufficiently small, B{p) has the representation 
B{p) = 4>(p, A) = pA{X)p~^ 
for some p G P, X G A C C^. 




FIGURE 20.5.4. 

Now let 7Ti and 7T2 be the projections onto P and A of P x A, respectively. 
Then it follows by definition of <f) that 

A = tt2^~^{B{p)), 

p = 7Ti^-^{B{p)). 

Thus, letting 

(j){p) = TT2 ^~^{B{p)), 

C{p) = ^,<^-\B{p)), 

it follows that 

B{p) = C{p)A{cj>{p))C-\p). 
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This proves Proposition 20.5.8. □ 

We remark that it should be clear from the argument that the deforma- 
tion is miniversal, i.e., we have used the smallest number of parameters. 

Thus, the proposition tells us that, in order to construct a miniversal 
deformation of Aq, we may take the family of matrices 

^0 + B, 

where B is in the orthogonal complement of the orbit of Aq, and the entries 
of B are the deformation parameters. An obvious question therefore is how 
do you compute B1 

Lemma 20.5.11 A vector B in the tangent space o/C” at the point Aq 
is perpendicular to the orbit of Aq if and only if 

[B*,Aq] = 0 

where B* denotes the adjoint of B. 

Proof: Vectors tangent to the orbit are matrices representable in the form 

[x, Aq] , X G M. 

Orthogonality of B to the orbit of Aq means that, for any x € M, 

([x,Ao],B) = 0, (20.5.10) 

where ( , ) is the inner product on the space of matrices, which we take as 

(A,B) =tr(AB*). (20.5.11) 

Using (20.5.10), (20.5.11) becomes 

0 = tr([a;, Ao]S*) 

= tr (xAqB* — AqxB*) . 

Using the fact that 

tr(AB) = tr(BA), 
tr(A + B) = trA -|- trB, 

we obtain 



tr(xAQB* — AqxB*) = tr(xAQB*) — tr(AQxB*) 
= tr(Aoi?*x) — tr(xB*Ao) 
= tr(AQB*x) — tr(B*AQx) 
= tr((AQB* - B*Aq)x) 

= tr([Ao,B*]x) 

= ([Ao,B*],a:*) = 0. 
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Since x was arbitrary, this implies 



[Ao,B*] = Q. □ 

This lemma actually allows us to “read off” the form of B if is in 
Jordan canonical form. 

Suppose Aq has been transformed to Jordan canonical form and has 
distinct eigenvalues 

tti 1 i 1 ‘ * j s, 

and to each eigenvalue there corresponds a finite number of Jordan blocks 
of order rii 

Tii(ai) > n2(a*) > • • • . 

For the moment, to simplify our arguments, we will assume that our 
matrix has only one distinct eigenvalue, a, and let us say three Jordan 
blocks ni(o;) > 712(0) > 77,3(0). The matrices which commute with Aq then 
have the structure shown in Figure 20.5.5, where each oblique segment in 
each separate Jordan block denotes a sequence of equal entries. 




FIGURE 20.5.5. 

Thus, a matrix B* in the orthogonal complement of Aq has the structure 
shown in Figure 20.5.6. The general proofs of these statements can be 
found in Gantmacher [1977], [1989] for the case of an arbitrary number of 
eigenvalues and Jordan blocks; however, we will not need such generality, 
since we will be concerned with 2 x 2, 3 x 3, and 4x4 matrices only and, 
in these cases, it is relatively easy to verify the structures shown in Figures 
20.5.5 and 20.5.6 by direct calculation. We will do this shortly. 

Therefore, a matrix of the structure shown in Figure 20.5.6 is orthogonal 
to Aq. Now in general we only desire transversality, not orthogonality, and 
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FIGURE 20.5.6. 

it may simplify matters (i.e., reduce the number of matrix elements) to 
choose a basis not orthogonal but transverse to Aq so that B* appears as 
simply as possible. This can be accomplished by taking a matrix of the 
form shown in Figure 20.5.6 as B* and replacing every slanted line by one 
independent parameter and the rest of the entries on the slanted line by 
zeros. 



FIGURE 20.5.7. 

The nonzero entry (independent parameter) can be placed at any po- 
sition along the slanted line. Thus, matrices transverse to Aq would have 
the structure shown in Figure 20.5.7, where the horizontal and vertical 
lines in the figure represent the positions where the required number of 
independent parameters are placed. From the above form of the matri- 
ces commuting with Aq, and only one distinct eigenvalue, the number of 
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parameters needed for a miniversal deformation is given by the formula 

ni(a) + 3n2(a) + bnz{ct) + • • • . (20.5.12) 

We now will state the general case for an arbitrary number of eigenvalues 
ai, i = 1, • • • , s, and then work out some explicit examples. 

Theorem 20.5.12 The smallest number of parameters of a versal defor- 
mation of the matrix Aq is equal to 

S 

d = ^ [ni{ai) + 3n2{ai) + 5n3(a*) H ] . (20.5.13) 

Proof: See Arnold [1983] and Gantmacher [1977], [1989]. □ 

We now sum up everything in our main theorem. 

Theorem 20.5.13 Every matrix Aq has a miniversal deformation; the 
number of its parameters is equal to the codimension of the orbit of Aq 
or, equivalently, to the dimension of the centralizer of Aq. 

If Aq is in Jordan normal form, then for a miniversal deformation we 
may take a d-parameter normal form (with d given in Theorem A.l.^) 
Aq + B, where the blocks of B have the previously described form. 

In other words, any complex matrix close to a given matrix can be reduced 
to the above d-parameter normal form Aq -\- B (where Aq is the Jordan 
canonical form of the given matrix), so that the reducing mapping and the 
parameters of the normal form depend in a C’’ manner on the elements of 
the original matrix. 

Now we will compute some examples. 

Example 20.5.2. Consider the matrix 

Ao=(q a\ (20.5.14) 

This matrix is denoted where a refers to the eigenvalue and two refers to the 
size of the Jordan block. In the context of bifurcations of fixed points of vector 
fields, we take a = 0 and, for fixed points of maps, we take a = 1. From (20.5.12) 
it follows that a versal deformation of (20.5.14) has at least two parameters; 
hence, matrices having Jordan canonical form Aq (i.e., from (20.5.1), the orbit of 
Aq) form a codimension 2 submanifold of C^. 

We now want to compute a versal deformation for Aq. First we compute a 
matrix which commutes with Aq. 

fa b\ f a l\ fa 1 \ f a b\ 

yc dy^O a J yO a J \c d ) 

_ f aa a-\-ba\ f aa -h c ba + d\ 

\ca c -I- da J ac ad J 

_ f —c a — d\ _ f 0 0\ 

"VO c )-[o oj’ 
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thus, 

c = 0, a — d, and b = arbitrary. 

Therefore, we obtain 

From Lemma 20.5.11, we have that a matrix orthogonal to Aq is given by 



B = 



a 0 
b a I ’ 



where a, 6 are arbitrary complex numbers. 

A family of matrices that is transverse to Aq and that simplihes our expression 
for a versal deformation would be 



B = 



0 0 

b a 



We can check whether or not B is transverse to Ao by showing that 
{B,B} = tr{BB*) / 0. 

In our case we have 



BB* = 



0 0\ /a 0 

b a ) \ b a 



0 



0 



ba + ab laf 



so that 



{B,B} = tr{BB*) = \a\^ / 0. 



Relabeling by letting b — and a = A 2 , we obtain the following versal defor- 
mation 



a 1 

0 a. 



0 0 

Ai A 2 j 



End of Example 20.5.2 



Example 20.5.3. Consider the matrix 

Ao=(“ aa. (20.5.15) 

From (20.5.12), it follows that a versal deformation of (20.5.15) has four pa- 
rameters; hence, matrices having Jordan normal form Aq form a codimension 4 
submanifold of C^. 

We next compute a family orthogonal to the orbit of Aq as follows 
fa b\ f a 0\ fa 0\ f a b\ _ f 0 0\ 

l^c dy\0 a J 1^0 ayl^c d J \^0 0 J ' 

f aa ba\ f aa ob\ _ f 0 0 A 

Y ca da J ac ad J 0 0 J ' 



or 
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Thus, a, b, c, d can be anything; this situation is therefore codimension 4 with 
a versal deformation given by 

a 0 \ / Ai A 2 

0 a J \ As A 4 

where we have relabeled the parameters as in Example 20.5.2. 

End of Example 20.5.3 



Example 20.5.4. Consider the matrix 



Ao = 




(20.5.16) 



From (20.5.12), it follows that a versal deformation of (20.5.16) has at least five 
parameters; hence, matrices having Jordan canonical form Aq form a codimension 
5 submanifold of C®. 

We compute a family of matrices orthogonal to the orbit of Aq as follows 




\ a 


b 




d 


e 


/ 


1 \9 


h 


ij 



aa 


a + ba 


ca \ 




/ aa + d 


ab + e 


ac + f 


da 


d + ea 


fa 


- 


I ad 


ae 


af 


ga 


g + ha 


ia J 




V ag 


ah 


ai 



-d 


a — e 


-f\ 


/O 


0 


0\ 


0 


d 


0 


= 0 


0 


0 


0 


9 


0 / 


\0 


0 


0/ 



we thus obtain d = 0, g — 0, f = 0, a = e, and b, c,h,i = arbitrary. 

Therefore, we obtain 

(a 0 0 \ 

B= ibah\ 

\c 0 i J 

or a simpler form, transverse to the orbit of Aq, given by 

/ 0 0 0 \ 

_B = [ Ai A2 As j , 

y A4 0 As / 

where we have relabeled the parameters as in Example 20.5.2. Hence, a versal 
deformation of Ao is given by 



a 1 0 



0 0 0 



0 d 0 I + I Ai A 2 As 



0 0 a , 



A4 0 As 
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We leave it to the reader to verify the transversality of B to the orbit of Aq. 

End of Example 20.5.4 



To summarize, the first few low codimension matrices have versal defor- 
mations given by 



a 



a 1 

0 a 



0 0 

Ai A2 





/ a 


OA , / 


aa 


\ 0 




' ' 


a J V 



Ai A2 
A3 A4 





/ ^ 


1 


0\ 


1 0 


0 


0 \ 


a^a 


r 


OL 


old- 


Ai 


A 2 


A 3 




\0 


0 


a J 


U 4 


0 


As / 



These are the simplest forms to which these parametrized families of ma- 
trices containing multiple eigenvalues can be reduced by a transformation 
depending differentiably on the parameters. 



20.5a Versal Deformations of Real Matrices 

Before leaving this section there is a very important point we should ad- 
dress, namely, that all of our work in this section has dealt with complex 
numbers. The reason for this is simple; it is much easier to deal with the 
Jordan canonical form when dealing with matrices of complex numbers. 
However, throughout this book we are mainly interested in real-valued 
vector fields. Thus it is fortunate that the results for versal deformations of 
matrices of complex numbers go over immediately to the situation of versal 
deformations of matrices of real numbers. The main idea is the following 
(Galin [1972], Arnold [1983]). 

The decomplexification of a versal deformation with the minimum 
number of parameters of a complex matrix, Aq, can be chosen to be 
a versal deformation with the minimum number of parameters of the 
real matrix Ao, where Aq is the decomplexification of Aq. 

This statement should be almost obvious after reviewing some definitions 
and terminology. We will discuss only what is necessary for our purposes 
and refer the reader to Arnold [1973], Hirsch and Smale [1974], or Chapter 
3 for more information. 

It should be clear that the decomplexification of C"; is Moreover, 
if ei,---,e„ is a basis of C", then ei,---,e„, iei,...,ie„ is a basis for 
the decomplexification of C", K^". Now, let A = Ar + iAi be a matrix 
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representation of some complex linear operator mapping C” into itself. 
Then the decomplexification of this matrix is given by the 2n x 2n matrix 

(20.5.17) 

Now we show how we can use these ideas to construct a versal deforma- 
tion of a real matrix. As in the complex case, the construction of a versal 
deformation of a matrix in Jordan canonical form is carried out “Jordan 
block by Jordan block”. Therfore we really only need to understand the 
case of a real matrix having exactly two complex eigenvalues (complex 
conjugate pairs). 

Consider a real matrix 



f Ar —Ai \ 
\ Ai Ar j 



Ao : ^ 

and suppose that it has exactly two eigenvalues 

01 i i(3, /? yf 0, 

and each eigenvalue has corresponding Jordan blocks of dimension 
ni > ri2 > ri3 > ■ ■ ■ , with rii + U2 + ns + ■ ■ ■ = n. 



Using the theory of the real Jordan canonical form, we can construct a 
real basis in with respect to which Aq has the form of the matrix of 
the decomplexification of the Jordan canonical form of the complex matrix 

io : C” ^ C”, 



where Aq has only one eigenvalue, a + if3, with Jordan blocks of dimensions 
ni > U 2 > ns > ■ ■ ■ , with rii + U 2 + ns + ■■■ = n, i.e.. 



Aq = 



( J -/Jid\ 
\Pid J )' 



where id denotes the n x n identity matrix and J is the upper triangular 
real Jordan matrix with eigenvalue a and blocks of dimension ni > ri 2 > 
ns > ■ ■ ■ , with ni + U 2 + ns + ■ ■ ■ = n. Then a versal deformation for Aq is 
constructed from a versal deformation of Aq as follows. 



With Ao in Jordan canonical form, construct the versal deformation 
according the the methods for complex matrices developed above. 

Then decomplexify the result according to (20.5.17). 

As we have noted, the construction of a versal deformation of a matrix 
in Jordan canonical form is carried out “Jordan block by Jordan block”. 
Therefore if the eigenvalues of a matrix are all real, we see that there is 
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no difference with the theory developed for the case of complex matrices. 
If there are complex eigenvalues, versal deformations for the corresponding 
Jordan blocks are constructed as described above. 

Moreover, we also see immediately that the minimum number of param- 
eters of a real versal deformation are also given by the formula: 

^ = X! + 3n2(A) + 5n3(A) • • •) , 

A 



where the summation is over all s eigenvalues, both real and complex. 

We next consider some examples. 

Example 20.5.5 (An Imaginary Pair of Eigenvalues). 

Consider a 2 x 2 real matrix having eigenvalues p + ir. Then with respect to 
an appropriate basis it has the form 



Ao = 



p -T 
T p 



We want to construct a versal deformation of this matrix. 

The complexification of Aq is given by 

Ao = (a) , a = p + ir. 

According to Theorem 20.5.12, a versal deformation for Ao can be constructed 
with one (complex) parameter, A = ^ -I- *7, i.e, the versal deformation of Ao is 
given by 

(a) + (A) . 

With this result, and (20.5.17), a versal deformation of Ao is given by 



p -T 
T p 



+ 



p -7 

7 h 



End of Example 20.5.5 

Examples 20.5.6, and 20.5.7 use the fact that versal deformations are 
constructed Jordan block-by-block, versal deformations of Jordan blocks 
corresponding to real eigenvalues are exactly the same as in the complex 
case (with real parameters), and example 20.5.5. 

Example 20.5.6 (One Real, and A Pure Imaginary Pair of Eigenvalues). 

Consider a 3 x 3 real matrix having eigenvalues p ± ir, (3, t ^ 0. Then with 
respect to an appropriate basis it has the form 

( p-T 0 
Ao = I r p 0 
\0 0 (3 
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A versal deformation requires three real parameters and is given by 

/p-r0\ /p- 70 \ 

rp 0 + 7 p 0 . 

\0 0/3/ \0 0 5/ 



End of Example 20.5.6 

Example 20.5.7 (A Pair of Distinct, Imaginary Pairs of Eigenvalues). 

Consider a 4 x 4 real matrix having eigenvalues at = pi + in, i = 1,2, with 
Qi 7 ^ 0 - 2 . Then with respect to an appropriate basis it has the form 

/ pi -n 0 0 \ 

. _ ri pi 0 0 

° ^ 0 0 P2 -T2 ■ 

\ 0 0 T2 p2 / 

This matrix is simple two Jordan blocks of example 20.5.5. Hence, using the result 
from example 20.5.5, a versal deformation of Aq requires four real parameters and 
is given by 



/ pi 


— n 


0 






/ pi 


-71 


0 


0 \ 


n 


Pi 


0 


0 


+ 


7 i 


Pi 


0 


0 


0 


0 


P 2 


—T2 


0 


0 


P2 


—72 


\o 


0 


T 2 


p 2 / 




^0 


0 


72 


P2- J 



End of Example 20.5.7 

Example 20.5.8 (A Repeated Complex Pair of Eigenvalues: The Semisimple 
and Non-Semisimple Cases). 

(s ly 

(o °)+(a: i)- 

Letting 

a — p + IT, 

Xi = pi * 7 * 1 

where p,r,pi, and 71 are real, (20.5.17) implies that the decomplexification of 
these matrices is given by 



( ^ 


1 


— T 


0 \ 




/ 0 


0 


0 


0 \ 


0 


p 


0 


— r 


+ 


pi 


P2 


-71 


—72 


T 


0 


p 


1 


0 


0 


0 


0 


\0 


T 


0 


P / 




\7i 


72 


pi 


P2 / 



(20.5.20) 
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/ p 


0 


— T 


0 \ 




/ Ml 


P-2 


-71 


-72 \ 


0 


p 


0 


— T 


+ 


M3 


Pi 


-73 


-74 


T 


0 


p 


0 


71 


72 


Ml 


M2 


\0 


T 


0 


p / 




\73 


74 


M3 


Pi / 



(20.5.21) 



End of Example 20.5.8 



20.5b Exercises 

1. Let M denote the set of all n X n matrices with complex entries. Show that M can be 

2 

identified as C”" (note: see Dubrovin, Fomenko, and Novikov [1984] for an excellent 
discussion of matrix groups as surfaces). 

2. Show that GL(n,C) is a submanifold of C”" . 

3. Show that the orbit of a matrix Aq G M under the action of GL(n,C) defined by 

gAog~^, geGL(n,C), 

is a submanifold of M. 

4. Show that the centralizer of any matrix of order n (with complex entries) is a linear 

2 

subspace of C”" . 

5. Prove that the dimension of the centralizer is equal to the codimension of the orbit of 
Aq. 

6. Explain why that, in the local coordinates near Aq constructed in Lemma 20.5.10, any 
matrix sufficiently close to Aq can be represented in the form 

pA(A)p“\ p e f , agAcc*^. 

Can you give a more intuitive explanation of this based on elementary notions from 
linear algebra? 

7. Explain why the deformation constructed in Proposition 20.5.8 is miniversal. 

8. Prove the following statement: 

The decomplexification of a versal deformation with the min- 
imum number of parameters of a complex matrix, Aq, can be 
chosen to be a versal deformation with the minimum number of 
parameters of the real matrix where is the decomplexi- 
fication of ^ 0 - 

9. Prove that the decomplexification of is and that if ei, • • • , Cn is a basis of C”", 
then ei , • • • Cn ^ fei , • • • , icn is a basis for the decomplexification of , R^”^ . 

10. Suppose A — Ar iAi is the matrix representation of some linear mapping of into 
C^. Then show that 




is the decomplexification of this matrix. 

11. Compute miniversal deformations of the following real matrices. 

/O -cj 0\ 
a) j a; 0 0 j 

Vo 0 0/ 
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h) 



0 1 
0 0 
0 0 

0 0 
0 0 
0 0 

0 



— CJ 
0 
0 



0 0 
1 0 



0 —CO 

CO 1 



12. Relate Theorem 19.5.7 from Chapter 19 (especially equations (19.5.44) and (19.5.45)) 
to the miniversal deformation of a matrix and, more generally, the construction of 
versal deformations developed in this chapter. 



20.6 The Double-Zero Eigenvalue: the 
Takens- Bogdanov Bifurcation 



Suppose we have a vector field on M” having a fixed point at which the 
matrix associated with the linearization of the vector field about the fixed 
point has two zero eigenvalues bounded away from the imaginary axis. In 
this case we know that the study of the dynamics near this nonhyperbolic 
fixed point can be reduced to the study of the dynamics of the vector field 
restricted to the associated two-dimensional center manifold (cf. Chapter 
18). 



We assume that the reduction to the two-dimensional center manifold 
has been made, and the Jordan canonical form of the linear part of the 
vector field is given by 



0 1 
0 0 



( 20 . 6 . 1 ) 



Our goal is to study the dynamics near a nonhyperbolic fixed point hav- 
ing linear part given by (20.6.1). The procedure is fairly systematic and 
will be accomplished in the following steps. 



1. Compute a normal form and truncate. 
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2. Rescale the normal form so as to reduce the number of cases to be 
studied. 

3. Embed the truncated normal form in an appropriate two-parameter 
family (see Example 20.4.11b) 

4. Study the local dynamics of the two-parameter family of vector fields. 

4a. Find the fixed points and study the nature of their stability. 

4b. Study the bifurcations associated with the fixed points. 

4c. Based on a consideration of the local dynamics, infer if global 
bifurcations must be present. 

5. Analyze the global bifurcations. 

6. Study the effect of the neglected higher order terms in the normal 
form on the dynamics of the truncated normal form. 

We remark that Step 4c is a new phenomenon. However, we will see that 
it is not uncommon for global effects to be associated with local codimension 
k {k > 2) bifurcations. Moreover, we will see that it is often possible to 
“guess” their existence from a thorough local analysis. We will discuss this 
in more detail later on. Now we begin our analysis with Step 1. 

Step 1: The Normal Form. In Example 19.1.2 we saw that a normal form 
associated with a fixed point of a vector field having linear part (20.6.1) is 
given by 

x = y + 0{\x\^,\y\^), 

y = ax'^ + bxy + 0{\x\^ ,\y\^), (x,y) (20.6.2) 

At this stage we will neglect the 0(3) terms in (20.6.2) and study the 
resulting truncated normal form 

x = y, 

y = ax^ + bxy. (20.6.3) 

Step 2: Rescaling. Letting 

x — >■ ax, 

y I3y, 

t -)> 7t, 7 > 0, 

(20.6.3) becomes 




y = + hba)xy. (20.6.4) 
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Now we want to choose 7 , (5, and a so that the coefficients of (20.6.4) are 
as simple as possible. Ideally, they would all be unity; we will see that this 
is not possible but that we can come close. 

We will require 

7/3 



= 1 



(20.6.5) 



or 



7 = 



a 



p' 



Equation (20.6.5) fixes 7 . We require a and /3 to have the same signs so 
that stability will not be affected under the rescaling (since 7 scales time). 
Next, we require that 



■jaa 



P 



= 1 . 



Using (20.6.5), (20.6.6) becomes 



aa^ { cP\ 

^ = aal^l = l. 



Equation (20.6.7) fixes a/ p. 

We finally require 

76 a = 1 

but, using (20.6.5), (20.6.8) becomes 

ba^ 

~P 



= bP 



= 1 . 



( 20 . 6 . 6 ) 



(20.6.7) 



( 20 . 6 . 8 ) 



(20.6.9) 



We can see that a and b can have either sign and that a and /3 must 
have the same sign. From (20.6.7) we can further see that a and a must 
have the same sign. Therefore, if (20.6.9) is to hold, we conclude that b and 
a have the same sign. This is too restrictive — the best we can do and still 
retain full generality is to require 



bp 



/3' 



= ± 1 , 



( 20 . 6 . 10 ) 



so that, in the rescaled variables, the normal form is 



x = y, 

y = x'^ ± xy. 



( 20 . 6 . 11 ) 



Step 3: Construct a Candidate for a Versal Deformation. From Example 
20.4.11b, a likely candidate for a versal deformation is 



x = y, 

y = fj,i+ p, 2 y + + bxy, 



6 = ± 1 . 



( 20 . 6 . 12 ) 
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Step 4- Study the Local Dynamics of (20.6.12). We take the case b = -hi. 

Step 40'- Fixed Points and Their Stability. It is easy to see that the fixed 
points of (20.6.12) are given by 

(x,y) = (±y=AB,0). (20.6.13) 

In particular, there are no fixed points for /ii > 0. 

Next we check the stability of these fixed points. 

The Jacobian of the vector field evaluated at the fixed point is given by 

/O 1 \ 0 1 

\2x fj, 2 +xj M2±\/=7b 

The eigenvalues are given by 

Ai,2 = ± ^ (20.6.15) 

If we denote the two branches of fixed points by (x+,0) = (-1-^/— /xi, 0) 
and (x“,0) = (— \/— /xi, 0), we see from (20.6.15) that (x+,0) is a saddle 
for /ii < 0 and all /i 2 , while for /xi = 0 the eigenvalues of (x+jO) are given 

by 

Ai,2 = A^2) 0- 

The fixed point (x~ , 0) is a source for {/X 2 > p,\ < 0} and a sink for 

{M 2 < V— /xi, Ml < 0}; for Ml = 0, the eigenvalues of {x~ , 0) are given by 

Ai,2 = M2) 0 

and, for M 2 = V— Mi> Mi < 0) the eigenvalues on {x~ , 0) are given by 




Ai,2 = ±* 




Thus, we might expect that mi = 0 is a bifurcation curve on which (x^, 0) 
are born in a saddle-node bifurcation and M 2 = \/~Mi) Mi < 0, is a bi- 
furcation curve on which {x~ , 0) undergoes a Poincare- Andronov-Hopf bi- 
furcation. We now turn to verifying this and studying the orbit structure 
associated with these bifurcations. 



Step 4b: The Bifurcations of the Fixed Points. We begin by examining the 
orbit structure near mi = 0, M 2 arbitrary. We will use the center manifold 
theorem (Theorem 18.1.2). 

First we put the system into the “normal form” for the center manifold 
theorem. We treat M 2 as a fixed constant in the problem and think of mi 
as a parameter, and we examine bifurcations from mi = 0. 

To transform (20.6.12) into the form in which the center manifold can 
be applied, we use the following linear transformation 





(20.6.16) 
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which transforms (20.6.12) into 



or 




/O 0 
Vo /X2 





— + (2 + H2)uv + (1 + ^ 2 )^^) \ 

V? + {2 + ^2)wv + {1 + ) 



ii = —— + (2 + ^2)uv + (1 + , 

M2 M2 

V = ^2V + — H [u^ + (2 + ^2)uv + (1 + M2)f^] • (20.6.17) 

M2 M2 



Without actually computing the center manifold, we can argue as follows. 

The center manifold will be given as a graph over u and fii, v(u, /ii), and 
be at least 0(2). Thus, from this we can immediately see that the reduced 
system is given by 

M= - — (/ii +u2) +0(3), (20.6.18) 

M2 

and thus it undergoes a saddle-node bifurcation at fii = 0. 

We can immediately conclude that the stable (unstable) manifold of the 
node connects to the unstable (stable) manifold of the saddle, since this 
always occurs for one-dimensional flows. This points out another advantage 
of the center manifold analysis, since such results are nontrival in dimen- 
sions > 2. 

We next want to examine in more detail the nature of the flow on the 
center manifold and what it implies for the full two-dimensional flow. Recall 
that the eigenvalues of the linearized two-dimensional vector held on the 
bifurcation curve are given by 



Ai,2 — M2) 0. 

In transforming (20.6.12) to (20.6.17), we see from (20.6.16) that the coor- 
dinate axes have be transformed as in Figure 20.6.1. 

Now the flow on the center manifold is given by 

M= - — (/ii +u^) +0(3), (20.6.19) 

M2 

and in the (m, Mi) coordinates appear as in Figure 20.6.2. 

Using the information in Figures 20.6.1 and 20.6.2 and recalling that the 
eigenvalues of the vector held linearized about the fixed point at = 0 
are Ai _2 = M 2 ) 0 > we can easily obtain phase portraits near the origin that 
show the bifurcation in the two-dimensional phase space on crossing the 
/i 2 -axis; see Figure 20.6.3. 




20.6 The Double-Zero Eigenvalue: the Takens-Bogdanov Bifurcation 441 





FIGURE 20.6.1. 




FIGURE 20.6.2. 

In the cases /ii slightly negative, notice the reversals of position for the 
stable and unstable manifolds of the saddle for 1^2 > 0 and /X2 < 0. 
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^2>0. ^t=o 




H2<0. ^1=0 




^2>0. Slightly negative 




^2<0. Slightly negative 




FIGURE 20.6.3. 



We next examine the change of stability of the fixed points {x ~ , 0) on 
Ml < 0- From (20.6.15), the eigenvalues associated with the 
linearization about this curve of fixed points are 



Ai,2 — 



±1 




If we view /i 2 as a parameter, then using (20.6.15) we obtain 



- — Re Ai 2 

dfj,2 



1 

2 



^0. 



Thus, it appears that a Poincare- Andronov-Hopf bifurcation occurs on fj ,2 = 

Next we check the stability of the bifurcating periodic orbits. Recall from 
Theorem 20.2.3 that this involves putting the equation in a certain “normal 
form” and then computing a coefficient, a, which is given by derivatives of 
functions occuring in this normal form. 

First we transform the fixed point to the origin via 



X = X — X 



y = y, 



so that (3.1.198) becomes 



0 1 
-2v/=7ir 0 



0 

xy + x^ 



( 20 . 6 . 20 ) 



Then we put the linear part of (20.6.20) in normal form via the linear 
transformation 

"a;\ _ / 0 1\ ! u 

y)~\ \/-2y=7u 0 



( 20 . 6 . 21 ) 
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under which (20.6.20) becomes 



0 






f u 



Ml 



( 20 . 6 . 22 ) 



0 



Notice that (20.6.23) is exactly in the form of (20.2.13), in which the coef- 
ficient a was given as follows 

— Y 0 {fuuu 4“ fuvv 4“ Quuv 4- 



1 



[fuvifuu 4- fvv^ 9uvi,9uu 4“ 9vv) fuu9uu 4“ fvv9vv] } 



16 i/- 2 y=/cr 

where all partial derivatives are evaluated at the origin. In our case, 

1 



f = uv + 
5 = 0 ; 






Ml 



thus, an easy calculation gives 



a = 



1 

I 61 /-M 1 



> 0 , 



indicating a suhcritical Poincare- Andronov-Hopf bifurcation to an unstable 
periodic orbit below = — /ii. 

This completes the local analysis, and we summarize the results in the 
bifurcation diagram in Figure 20.6.4. 

Step 4c: Global Dynamics. At this stage we have analyzed all possible local 
bifurcations; however, a careful study of Figure 20.6.4 reveals that there 
must be additional bifurcations. This conclusion is based on the following 
facts. 



1. Note the stable and unstable manifolds of the saddle point. For the 

case pL 2 > \/— Ml, Mi < the stable and unstable manifolds have the 
opposite “orientation” compared with the case /12 < 0, < 0. It 

appears as if the manifolds have “passed through each other” as p .2 
decreases. 

2. Using index theory, it is easy to verify that (20.6.12) {b = 4-1) has 
no periodic orbits for /xi > 0 (since there are no fixed points in this 
region). Hence, in traversing the /ii — /i 2 plane in an arc around the 
origin starting on the curve / 1.2 = —Mi &nd ending in > 0 (see 
Figure 20.6.5), we must somehow cross a bifurcation curve (s) which 
results in the annihilation of all periodic orbits. This cannot be a local 
bifurcation, because these have all been taken into account. 
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Saddle-node 







F2 







Step 5: Analysis of Global Bifurcations. We postpone an analysis of the 
global bifurcation in this case until Volume 4 and merely state the result 
for now. 

In this case a likely candidate for the global bifurcation which will com- 
plete the bifurcation diagram is a saddle-connection or homoclinic bifurca- 
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tion. In Chapter 33 we show that this occurs on the curve 

_ 49 2 , 5/2x 

Ml — + Ci(/i2 ), 

which we shown in Figure 20.6.6. From this figure one can see that the 
saddle-connection or homoclinic bifurcation is described by the periodic 
orbit created in the subcritical Poincare- Andronov-Hopf bifurcation grow- 
ing in amplitude as H 2 is decreased until it collides with the saddle point, 
creating a homoclinic orbit. As /i 2 is further decreased, the homoclinic orbit 
breaks. This explains the reversal in orientation of the stable and unstable 
manifolds of the saddle point described above, and it also explains how the 
periodic orbits created in the subcritical Poincare-Andronov-Hopf bifurca- 
tion are destroyed. 




FIGURE 20.6.6. 



Step 6: Ejfects of Higher Order Terms in the Normal Form. Takens [1974] 
and Bogdanov [1975] proved that the dynamics of (20.6.12) are not qual- 
itatively changed by the higher order terms in the normal form. Hence, 
(20.6.12) is a versal deformation. We will discuss the issues involved with 
proving this more thoroughly in Chapter 33. For this reason, the bifur- 
cation associated with the non-semisimple souble zero eigenvalue is often 
referred to as the Takens-Bogdanov bifurcation. 

This completes our analysis of the case 6 = 4-1; the case 6 = — 1 is 
very similar, so we leave it as an exercise. Before leaving the double-zero 
eigenvalue, we want to make some final remarks. 

Remark 1. The reader should note the generality of this analysis. The nor- 
mal form is completely determined by the structure of the linear part of 
the vector field. 
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Remark 2. Global dynamics arose from a local bifurcation analysis. For 
two-dimensional vector fields these dynamics cannot be very complicated, 
but for three-dimensional vector fields chaotic dynamics may occur. 



Remark 3. When one speaks of the “double-zero eigenvalue” for vector 
fields, one usually means a vector field whose linear part (in Jordan canon- 
ical form) is given by 




However, the linear part 




is also a double-zero eigenvalue. This case is codimension 4 and is conse- 
quently more difficult to analyze. 



20.6a Additional References and Applications for 

THE TAKENS-BOGDANOV BIFURCATION 

The Takens-Bogdanov bifurcation arises in a variety of applications and 
is still a current topic of research. Recent references are Kertesz [2000], 
Belhaq et al. [2000], Algaba et al. [1998], [1999a,b,c,d], Batiste et al. [1999], 
Renardy et al. [1999], Champneys et al. [1999], Ramanan et al. [1999], 
Nikolaev et al. [1999], Degtiarev et al. [1998], Needham and McAllister 
[1998], Skeldon and Moroz [1998], Tracy and Tang [1998], Tracy et al. 
[1998], Kertesz [1997], Golubitsky et al. [1997], and Labate et al. [1997]. 



20.6b Exercises 

1. The Double-Zero Eigenvalue with Symmetry. Consider a (r as large as neces- 
sary) vector field on having a fixed point at which the matrix associated with the 
linearization has the following Jordan canonical form 

(s ;)■ 

Let (fc, y) denote coordinates on and suppose further that the vector field is equiv- 
ariant under the coordinate transformation 

{x, y) i-A (-a:, -y). 

This exercise is concerned with the bifurcations near such a degenerate fixed point. 

a) Show that a normal form for this vector field near this nonhyperbolic fixed point 
is given by 

X - y-hO(5), 
y — ax^ -\- bx^y 0(5). 

b) Following the procedure described earlier, show that a candidate for a versal 
deformation is given by 

X — y + 0(5), 

y — fiix y. 2 y + + bx^y + 0(5). 
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FIGURE 20.6.7. 



In the following we will be concerned with the dynamics of the truncated normal form 

X ^ y, 

y = y.\x -I- y. 2 V -I- ax^ + bx'^y. 



c) 



Show that by rescaling, the number of cases to be considered can be reduced to 
the following 



X ^ y, 

y — y.ix + fi 2 y + cx^ - x^y, 



(20.6.23) 



where c = ±1. 




C=-1 



FIGURE 20.6.8. 

d) For ij,i — fj ,2 — 0, show that the flow near the origin appears as in Figure 20.6.7 
for c = +1 and as in Figure 20.6.8 for c — —1. 

e) Show that (20.6.23) has the following fixed points 

c ^ +1 : (0,0), (±U^7U,0), 

-1 : (0,0), (±0ir,O). 
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f) Compute the linearized stability for the fixed points for both c = +1 and c = — 1 
and show that the following bifurcations occur. 

c = +1 : pitchfork on = 0, 

supercritical Poincare— Andronov-Hopf 
on <0, fi 2 — 0. 

c = — 1 : pitchfork on = 0, 

subcritical Poincare— Andronov— Hopf 
on fii — fi 2 , Ml > 0- 

g) Show that (20.6.23) has no periodic orbits for 

c — +1 : > 0; 

<0, M2 < 0; 

M2 > -Ml/5, Ml < 0- 

c — — 1 : ^2 < 0. 

{Hint: use Bendixson’s criterion and index theory.) 

h) Use the results obtained in d) —¥ e) and completely justify the local bifurcation 
diagrams shown in Figure 20.6.7 for c = +1 and in Figure 20.6.8 for c — —1. 

i) Based on an examination of Figures 20.6.7 and 20.6.8, can you infer the necessity 
of the existence of global bifurcations? What scenarios are most likely? 

We will return to this exercise in Chapter 33 to study possible global bifurcations in 
more detail. 



2 . 



The averaged equations for the forced van der Pol oscillator (See Holmes and Rand 
[1978]) are given by: 



u = u — (7V — u{u^ + v^), 

V — (Tu + u — v{u^ + v^) — 7. 



(20.6.24) 



Consider the bifurcation diagram in Figure 20.6.9. 



The object of this exercise is to derive the bifurcation diagram. 



a) Show that (20.6.24) has a single fixed point in regions I and III (a sink in I, a 
source in III). Show that in region II there are two sinks and a saddle, and in 
region IVa U IVb there is a sink, a saddle, and a source. 

b) Show that (20.6.24) undergoes a saddle-node bifurcation on 



— (l-b9<r^)-h — {l + cr'^f = 0 . 

4 27 ^27 ’ 



This is the curve DAC marked Bs in Figure 20.6.9. 

c) Show that (20.6.24) undergoes a Poincare— Andronov— Hopf bifurcation on 



87 ^ — 4cr^ 4-1, |cr| > -. 

This is the curve OE marked Bh in Figure 20.6.9. 

d) In Figure 20.6.9, consider the broken lines crossing the curves OA, OD, 

AB, BE, and OB. Draw phase portraits representing the fiow on and to each 
side of the indicated curve; see the example in Figure 20.6.9. 

e) OS is a curve on which homoclinic orbits occur (sometimes called saddle con- 
nections). Give an intuitive argument as to why such a curve should exist. (Is it 
obvious that it should be a smooth curve?) 

f) Discuss the nature of (20.6.24) near the points A, O, and C. 

g) (20.6.24) is an autonomous equation whose flow gives an approximation to the 
Poincare map of the original forced van der Pol equation in a sense made precise 
by the averaging theorem. Using the previously obtained results, interpret Parts 
i) — >• vi) in terms of the dynamics of the original forced van der Pol equation. In 
particular, list the structurally stable motions and bifurcations along with the 
structurally unstable bifurcations. 



If you need help you may consult Holmes and Rand [1978], where these results were 
first worked out. 
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Region I, Sink On Bj, Sink & Region II 

Saddle-node (2 Sinks, Saddle) 



FIGURE 20.6.9. 

20.7 A Zero and a Pure Imaginary Pair of 
Eigenvalues: the Hopf-Steady State 
Bifurcation 

Suppose that the linear part of the vector field (after a possible center 
manifold reduction) has the following form 

/O -u 0\ / x\ 

U 0 0 y . (20.7.1) 

VO 0 Oj \z) 

The bifurcation associated with this nonhyperbolic equilibrium point is 
sometimes referred to as the Hopf-steady state bifurcation, since it is a 
combination of a Hopf bifurcation (i.e, there is a pure imaginary pair of 
eigenvalues) and a steady state bifurcation (i.e., a zero eigenvalue associated 
with a bifurcation of an equilibrium point, or “steady state”). 

The normal form for this case is given by: 

r = airz + a 2 p + asrz"^ + 0(\r\^, |z A, 
z = bir^ + b 2 Z^ + b^r^z + 0(|r|h \z\^), 

0 = uj + ciz + 0{\rf ,\z\‘^), 



(20.7.2) 
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(see Exercise 5 in Section 19.4 in Chapter 19). This is the vector field we 
will study. Notice that the 0-dependence in the r and z components of 
the vector field can be removed to order k for k arbitrarily large (note: 
exactly the same thing occurred when analyzing the normal form for the 
Poincare- Andronov-Hopf bifurcation). This is important, because it is a 
major tool in facilitating the analysis of this system. Specifically, recall 
that our analysis is only local (i.e., r, z sufficiently small), so that we have, 
for r, z sufficiently small, 0 yf 0. Thus, we will truncate our equation at 
some order and, ignoring the 0 part of our vector field, perform a phase 
plane analysis on the r, z part of the vector field. For r, z sufficiently small, 
in some sense (to be made precise later) the r — z phase plane can be 
thought of as a Poincare map for the full three-dimensional system. Also, 
we must consider the effects of higher order terms on our analysis, since it 
is not necessarily true that in the actual vector field the (r, z) components 
are independent of 0; we only push the 0-dependence up to higher order 
with the method of normal forms. 

Our analysis will follow the same steps as our analysis of the double-zero 
eigenvalue in Section 20.6. 

Step 1: Compute and Truncate the Normal Form. The normal form is given 
by (20.7.2). For now we will neglect terms of 0(3) and higher and, as 
described above, the 0 component of (20.7.2). Thus, the vector field we will 
study is 



r = oirz, 

z = bir"^ + b 2 z'^ . (20.7.3) 



Step 2: Rescaling to Reduce the Number of Cases. Rescaling by letting 
f = ar and z = /3z, we obtain 



Oi 



aP\ 



2 = (3 



bl _2 02 _2 

2^ + 



a 



Now, letting j3 = —02, a = — a/|0i 02|, and dropping the bars on r,z, we 
obtain 

ai 

r = ——rz, 

02 

— 0102 2 2 

^ = irTT’" “ ^ ’ 

0102 



or 



f = arz, 
z = br'^ - z^. 



(20.7.4) 
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where a = is arbitrary (except that it is nonzero and bounded) and 
b=±l. 

Next we want to determine the topologically distinct phase portraits of 
(20.7.4) which occur for the various choices of a and b. We will find that 
there are six different types, which (following Guckenheimer and Holmes 
[1983]) we label I, Ila, Ilb, III, IVa, IVb, because the versal deformation of 
Ila and lib as well as the versal deformation of IVa and IVb are essentially 
the same. 

The key idea in determining these classifications involves finding certain 
invariant lines (separatrices) for the flow given hy z = kr (note that r > 0). 
Substituting this into our equation gives 

dz ^ br'^ — k^r'^ b — k^ 
dr akr'^ ak 

or 

t = (20.7.5) 

hence, the condition for such invariant lines to exist is > 0. 

Note that r = 0 is always invariant, and the equation is invariant under 
the transformation z — >■ —z, t — >■ —t. 

Therefore, for 6 = 1 there are two distinct cases 

a < —1, a > —1, 

and, for 6 = — 1, there are two distinct cases 

a < —1, a > —1. 

The direction of the flow on these invariant lines can be calculated by 
taking the dot product of the vector held with a radial vector held evaluated 
on the invariant line. 



s = {arz,br'^ - z'^) ■ {r,z)\^^f^^ 

= r^k{a + b — k^). (20.7.6) 

Substituting (20.7.5) into (20.7.6) (and taking the ‘+’ sign in (20.7.5), 
which will give the direction of flow along z = kr in the first quadrant) 
gives 

9 

(IT Z 

s=- (a + &+l). (20.7.7) 

If this quantity s is > 0 (take z,k > 0), then the flow is directed radially 
outward for z > 0. If s < 0, then the flow is directed inward for z > 0. The 
opposite case occurs for z, fc < 0. We summarize this information below. 

b = +1, a < —1. There are no invariant lines except r = 0. 
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b= +1, a > —1. From (20.7.5) we see that, in this case, we do have an 
invariant line and, from (20.7.7), that the direction of flow along this line 
is governed by the sign of 

a 

1 Q 

Hence, we have two cases 

— >0 for a > 0, 

1 + a 

— <0 for — 1 < a < 0. 

1 + 0 

We will not consider the degenerate o = 0 case, since this will necessitate 
the consideration of higher order terms in the normal form. 

b = —1, o>— 1. There are no invariant lines except r = 0. 

b = —1, a < —1. From (20.7.5) and (20.7.7) we see that, in this case, we 
do have an invariant line with s < 0. 




I:b=+1,a>0 Ha: b=+1,ae(1,0) Mb: b-+1, a <-1 




III: b=-1 , a>0 IVa: b=-1 , ae[-1 ,0) IVb: b=-1 , a<-1 



FIGURE 20.7.1. 

We summarize the information obtained concerning the orbit structure 
of (20.7.4) in Figure 20.7.1. There are thus six topologically distinct cases. 
However, we are still not quite finished with their phase portraits. Notice 
that (20.7.4) has the first integral 

/(r,z)=^r^/“[^-z^]. (20.7.8) 
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The reader can check this by showing 



dl . dl . ^ 

or oz 



where f and z are obtained from (20.7.4). 




1: b=+1, a>0 



FIGURE 20.7.2. 



Now this first integral can give us information concerning whether or 
not there are closed orbits in our phase portraits and, of course, the level 
curves give us all the trajectories. We will thus examine the level curves of 
/(r, z) for each of our six cases. Also, we will only analyze the r > 0, z > 0 
quadrant of the (r, z) plane, since knowledge of the flow in this quadrant 
is sufficient due to the symmetry z — >■ — z, t — >■ —t. 

Case I . We begin with Case I for which we have b = +1 and a > 0, which 
implies that k = ^ < 1. 

Recall that, in this case, the vector field is given by 



r = arz, 



from which we see that, in r > 0, z > 0, we have 

r > 0 r is increasing on orbits, 

z = 0 on the line r = z. 

For z > 0, z below the line r = z, we have z > 0, which implies that z is 
increasing on orbits with initial z values below the line r = z. The opposite 
conclusion holds for orbits starting above r = z. Also, since the line z = kr 
is invariant and thus cannot be crossed by trajectories, and since z = kr 
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lies below z = r, we conclude that trajectories below the line z = kr must 
have z and r components increasing montonically. These observations allow 
us to sketch the phase portrait shown in Figure 20.7.2. 



Case Ila . We have b = +1, a G (—1, 0), which implies k 




> 1 . 



1. In this case the line z = r lies below the invariant line z = kr; 
therefore, the only place where z can vanish (besides the origin) is 
below z = kr. 

2. Also, due to the fact that a £ (—1,0), in the quadrant r > 0, z > 0, 
we always have r < 0; hence, r is always decreasing on orbits. 




z=kr 



z=r 



z=-kr 



FIGURE 20.7.3. 



Now we will consider our first integral 



I(r,z) 



“^2/a 



1 -t“ u 



The level curves of this function are trajectories of (20.7.4). The following 
lemma will prove useful. 

Lemma 20.7.1 A level curve of I(r,z) may intersect the line z = r only 
once in r > 0, z > 0. 

Proof: At this stage of our analysis of Case Ila the orbit structure shown 
in Figure 20.7.3 has been verified. Note that at the point (r, 0) shown in 
Figure 20.7.3 we have 



z = r^ 



r = 0, 



at (r,0). 
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By the comments 1 and 2 above, since r < 0 everywhere in z > 0, r > 0, 
and i > 0 at (r, 0), the trajectory starting at (r, 0) must eventually cross 
the line z = r. 

Now the trajectory starting at (r, 0) lies on the level curve given by 



It intersects the line z = r, and we can compute the r coordinate of the 
intersection as follows on z = r 



I{r,r) = c = 



2 



2 



1 -t“ u 

9 

ar^ 

1 -|- (i 



r^(l + a) 

1 -j- Cl 

= ^2+(2/g) 

2(1 + a) 



(20.7.10) 



z 




FIGURE 20.7.4. 

We can compute the r coordinate of the intersection point in terms of 
the starting point f by equating (20.7.9) and (20.7.10) 



2(1 + a) 



r2+(2/a) _ 



l2f2+(2/a) 

2(1 + a) 



and solving for 

/ \ (2a+2) 

r = I I r. 

V aj 

Thus, given (r, 0) as an initial condition for a trajectory, we conclude 
that this trajectory can intersect the line z = r only at the unique value 
of r given above (and hence also at a unique value of z). This proves the 
lemma. □ 
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This lemma therefore tells us that once a trajectory crosses the z = r 
line it is forever trapped between the lines z = r, z = kr and, since f < 0, 
it must approach (0, 0) asymptotically. Putting together these results and 
using the symmetry z — >■ — z, t — >■ —t, we obtain the phase portrait for Case 
Ila shown in Figure 20.7.4. 

Case Ilb . There are no invariant lines in this case; however, the arguments 
given in Case Ila can be slightly modified to yield the phase portrait shown 
in Figure 20.7.5. 



z 




FIGURE 20.7.5. 



We now proceed to the b = —1 cases. In these cases z is always decreasing. 

Cases III and IVa . These cases are easy since there are no invariant lines; 
hence, there is no additional orbit structure beyond that shown in Figure 
20.7.1 (note: the reader should verify the different “dimples” exhibited by 
phase curves in these figures upon crossing the r-axis). 

Case IVb. We have 6 = — 1 and a < — 1 with k = Since z is 

Y l+a 

decreasing and r is decreasing we have the phase portrait shown in Figure 

20.7.6. This completes the classification of the possible local phase portraits. 

We now show them together in Figure 20.7.7 for comparative purposes. 

Step 3: Construct a Candidate for a Versal Deformation. From Section 

20.4 and, in particular. Example 20.5.6 from Section 20.5, a candidate for 

a versal deformation is given by 

f = p,ir + arz, 

z = fj .2 + br^ — z“^ , b = ±1. (20.7.11) 



Step 4- Study the Local Dynamics of (20.7.11). 
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z 




IVb: b=-1 , a<-1 

FIGURE 20.7.6. 




I;b-+1,a>0 lla:b=+1,ae(1,0) lib: b=+1, a <-1 




III: b=-1 , a>0 IVa: b=-1 ,ae [-1 ,0) IVb: b=-1 , a<-1 

FIGURE 20.7.7. 



Step 4^a. Fixed Points and Their Stability. It is easy to see that there are 
three branches of fixed points of (20.7.11) given by 

(r,z) = 

(note: r > 0). 

We next examine the stability of these fixed points. The matrix associ- 
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ated with the linearized vector field is given by 

■'=('“ 2 ;“ -L)- 

Before analyzing the stability of each branch of fixed points, we want to 
work out a general result that will save much time. 

The eigenvalues of (20.7.13) are given by 

Ai, 2 = ^ ± ^ V(trJ)"-4det J. (20.7.14) 

Using (20.7.14), we notice the following facts 



if 


tr 


J 


> 


0, 


det 


J 


> 


0, 


then 


Ai 


> 


0, A 2 > 


0 ^ 


source. 


if 


tr 


J 


> 


0, 


det 


J 


< 


0, 


then 


Ai 


> 


0, A 2 < 


0 ^ 


saddle. 


if 


tr 


J 


< 


0, 


det 


J 


> 


0, 


then 


Ai 


< 


0, A 2 < 


0 


sink. 


if 


tr 


J 


< 


0, 


det 


J 


< 


0, 


then 


Ai 


> 


0, A 2 < 


0 


saddle. 


if 


tr 


J 


= 


0, 


det 


J 


< 


0, 


then 


Ai, 


2 = 


= ±\/ det J 


saddle. 


if 


tr 


J 


= 


0, 


det 


J 


> 


0, 


then 


Ai, 


2 = 


= ±*vT 


detJ ,=^ center. 


if 


tr 


J 


> 


0, 


det 


J 


= 


0, 


then 


Ai 


= 


tr J, A 2 


= 0, 




if 


tr 


J 


< 


0, 


det 


J 


= 


0, 


then 


Ai 


= 


tr J, A 2 


= 0. 


(20, 



We now analyze each branch of fixed points individually. 
(0, y/jL^)- On this branch of fixed points we have 

tr J = (/ii + ay/Jja) - 2v7^, 
det J = -2y/jL^{ni + ay^), 

from which we conclude 



tr J > 0 


Ml 


tr J < 0 


Ml 


det J > 0 ^ 


Ml 


det J <0 ^ 


Ml 



+ ay^ > 2y^Jl2, 
+ ayfiX2 < 2y/Jl2, 
+ ay^ < 0 , 

+ 0,y^Jl2 > 0. 



Appealing to (20.7.14) and using (20.7.15), we make the following con- 
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elusions concerning stability of the branch of fixed points (0, 
tr J >0, det J > 0, cannot occur, 

if tr J > 0, det J < 0, then /xi + saddle, 

if tr J < 0, det J > 0, then /xi + < 0 sink, 

if tr J < 0, det J < 0, then 0 < + ay/jL 2 < 

if tr J = 0, det J < 0, then ~ 2^^ ^ saddle, 

’ ’ Ml + aV/^ > 0 

tr J = 0, det J > 0, cannot occur, 

if tr J > 0, det J = 0, M2 = 0) Mi > 0 bifurcation, 

if tr J < 0, det J = 0, then /xi + a^JJl 2 = 0 or ^2 = 0, 

Ml < 0 ^ bifurcation. 

Thus, (0, ^/JI 2 ) is a 

sink for fii + a^/JI^ < 0, 

saddle for fii + > 0. 

Later we will examine the nature of the bifurcation occurring on /ii + 

= 0 and M2 = 0- 

Next we examine the branch (0, 

(0, —y/Jii). On this branch we have 

tr J = /ii - + 2^/JI^, 

det J = 2 y^(/xi - ay/JI^), 

from which we conclude 

tr J > 0 Ml ~ > —2y/JI^, 

tr J < 0 Ml ~ < —2y/JI^, 

det J > 0 Ml ~ > 0, 

det J < 0 Ml ~ a-\/^ < d. 

Appealing to (20.7.14) and using (20.7.15), we make the following con- 
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elusions concerning the stability of the branch of fixed points (0, — y^): 
if tr J > 0, det J > 0, then /xi — a^/Jl 2 > 0 source, 

if tr J > 0, det J < 0, then 0 > ^ saddle, 

tr J <0, det J > 0, cannot occur, 

if tr J < 0, det J < 0, then /xi — ay/jL 2 < — 2y/^ saddle, 

tr J = 0, det J > 0 cannot occur, 

if tr J = 0, det J < 0, then /xi — saddle, 

if tr J > 0, det J = 0, then /xi = ay/Ji 2 ^ bifurcation, 



tr J <0, det J = 0 cannot occur. 

We thus conclude that (0, — > 0) has the following stability character- 
istics. 

source for fii — > 0, 

saddle for ni — < 0. 

Later we will examine the bifurcation occuring on /ii — Oy/^ = 0. 

Now we turn to an examination of the remaining branch of fixed points 
(note that our previous analysis did not depend on h) 




We examine the cases 6 = -1-1 and h = —1 separately. 



b = -1-1, { \ ^ — 4 i 2 , — ^ 1 • This branch exists only for ^ and on 
this branch we have 





(20.7.16) 

(20.7.17) 



We now must consider Cases I and Ila and b individually. 

Case I: g > 0 . From (20.7.17), for a > 0 we have det J < 0. Using (20.7.15), 
for det J < 0, the fixed point is always a saddle. 
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Cases IIa,b: a < 0. From (20.7.17), for o < 0 we have det J > 0. Hence, 
using (20.7.15), we conclude the following 






/r-i > 0, ^ > 0 



source. 






< 0, ^ — /X 2 > 0 sink. 



We might guess that on = 0, /X 2 < 0, a Poincare- Andronov-Hopf bifur- 
cation occurs. 

Next we examine the case 6 = — 1. 



b=-l, 



) • For this case we have 



tr J = 



2^1 



det J = 2a I /i 2 y 



Ml 



(20.7.18) 

(20.7.19) 



(Note that ^2 y > 0.) 

We will examine Cases III and IVa and b individually. 

Case III: a > 0 . Using (20.7.19), it follows that det J > 0 which, when used 
with (20.7.18) and (20.7.15), allows us to conclude that 

> 0 source, 

/ii < 0 sink. 




FIGURE 20.7.8. 
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Hence, we might guess that a Poincare-Andronov-Hopf bifurcation is 
possible on /n = 0, /i 2 > 0. 

Cases IVa,b: a < 0. Using (20.7.19), we see that det J < 0, which, when 
used with (20.7.18) and (20.7.15), allows us to conclude that 

< 0 saddle, 

> 0 saddle. 

Hence, no Poincare-Andronov-Hopf bifurcations occur. 

This completes the stability analysis of the fixed points. We next examine 
the nature of the various possible bifurcations. 

Step 4b- The Bifurcations of the Fixed Points. First we examine the two 
branches 

(0,±v^) ■ 

These branches exist only for p .2 > 0, coalescing at p .2 = 0. We thus ex- 
pect them to bifurcate from (0, 0) in a saddle-node bifurcation. Since these 
branches start on r = 0 and remain on r = 0, the center manifold analysis 
is particulary simple — we simply set r = 0 in our original equations and 
obtain 

Z = pL2 — Z^. 

(Note that the equation is independent of b with bifurcation diagram shown 
in Figure 20.7.8.) 

Next we examine the bifurcation of the branches (0,±y^), /i 2 > 0, 
which occurs on pL\ ± Qy/JIf = 0. 

We will do a center manifold analysis. First we transform the fixed point 
to the origin. 

Letting f = z^ y/hf, (20.7.11) becomes 
f = p,ir + ar{f ± 

^ - (e ± ^2?, h2 > 0. (20.7.20) 

We are interested in the flow in a neighborhood of the curve fXi ± = 

0; we illustrate this curve in the (/xi — /i 2 )-parameter plane in Figure 20.7.9. 

Therefore, we will set /xi = constant and ^/Jlf — ~ This corre- 

sponds to crossing the parabola vertically for fixed We will have to pay 
close attention to the direction in which we cross the curve by varying e; 
we will come back to this later. Substituting ^ into (20.7.20) 

gives 



r = /XiT -I- or ± ^ 



hi 

T— -e 

a 



= T e 

a 
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FIGURE 20.7.9. 



or 



f = ar^ =F are, 

i = 2[^±s)^ + br^-e-, 

in matrix form (including the parameters as a dependent variable in antic- 
ipation of applying center manifold theory), it gives 

f r \ _ f 0 0 \ , f ar^ T are \ 

e = 0. (20.7.21) 

Fortunately, (20.7.21) is already in the standard form for application of 
the center manifold theorem. From Theorem 18.1.2, the center manifold 
can be represented as follows 

= { (g £, a I ? = h(r, e), h(0, 0) = 0; Dh(0, 0) = 0} 



for r and e sufficiently small where h satisfies 

Dh{x) [Bx + f{x, h{x))] — Ch{x) — g{x, h{x)) = 0, 

with 



(20.7.22) 



/ = 



X = (r,e), 

ar^ =F are 

0 



B = 



0 0 
0 0 



c = 



2/xi 



g = ±2e^ -I- 6r^ — 



Substituting /i(r, e) = ar^ -|- /3re -I- ye^ -f 0(3) into (20.7.22) gives 



(2ar + f3e + 0{3),f3r + 2ye H- 0(3)) 



arh =F are 
0 
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— + /3re + 7 ^^ + 0(3)) — {±2eh + br"^ — h^) = 0. 

a 

(20.7.23) 
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and, therefore, the vector field (3.1.227) restricted to the center manifold 
is given by 

r = ar ( + £^(3)^ T ore 

V 2/xi / 



or 



f = r 




T ae + 



(20.7.24) 




|a<0, iii<0, b=+lj 




e 




FIGURE 20.7.11. Bifurcations on the center manifold for (0, 

This equation indicates that pitchfork bifurcations occur at £ = 0, but 
note that, for us, the only bifurcating solution that has meaning is the 
r > 0 solution. 

We now use (20.7.24) to derive bifurcation diagrams for each branch of 
fixed points. 

{Q,+y/Ji 2 )- The bifurcation curve is given by 

Ml 

5 

a 
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and e increasing from negative to positive corresponds to decreasing 
across the bifurcation curve. 

The vector field restricted to the center manifold is 

( 2 \ 
r = r — - — r — ae ] + ■ ■ ■ , 

V 2/ri J 



from which we easily obtain the bifurcation diagrams shown in Figure 
20.7.10. 

(0, The bifurcation curve is given by 



— • 

a 

The vector field restricted to the center manifold is 

ae 



r = r \ -- — r 
2/ti 



from which we easily obtain the bifurcation diagrams shown in Figure 
20.7.11. 

Now we want to translate these center manifold pictures into phase por- 
traits for the two-dimensional flows. Recall that the eigenvalues for the 
direction normal to the center manifold is given by . We draw sepa- 
rate diagrams for each branch in Figures 20.7.12 and 20.7.13. 

At this point we will summarize our results thus far. 

A saddle-node bifurcation occurs at /X 2 = 0, giving us two branches of 
fixed points (0,±^^). 

1. (0, undergoes a pitchfork bifurcation on -|- = 0 with 

a new fixed point being born above the curve for b = —1 and below 
the curve for h = -\-l. 

2. (0, —y/jL 2 ) undergoes a pitchfork bifurcation on yf]P 2 ~ ^ = 0 with 
a new fixed point being born above the curve for b = —1 and below 
the curve for b = -\-l. 



Detailed stability diagrams can be obtained from the previously given 
diagrams. 

To complete the local analysis we must examine the nature of the possible 
PoincarAAndronov-Hopf bifurcations in Cases IIa,b and III. 

We examine each case individually. 

Cases IIa,b. The branch of fixed points is given by 



(r, z) = 




a 



(20.7.25) 
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FIGURE 20.7.12. Bifurcations in the r — ^ plane for (0, +y7t2). 



where a < 0 and -^ > U 2 - 

Our candidate for the Poincare- Andronov-Hopf bifurcation curve is given 

by 

Ml = M2 < 0. 

On this curve the eigenvalues of the vector field linearized about a fixed 
point on the branch (20.7.25) are 

Ai_ 2 = ±t\/|2aM2|- 



The reader should recall from Theorem 20.2.3 that there are two quan- 
tities which need to be determined. 



1. The eigenvalues cross the imaginary axis transversely. 
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FIGURE 20.7.13. Bifurcations in the r — ^ plane for (0, —y/JIi)- 



2. The coefficient a in the Poincare-Andronov-Hopf normal form is 
nonzero. (Note: this should not be confused with the coefficient a 
in the normal form that we are presently studying, which we have 
assumed to be nonzero.) 



We begin by verifying statement 1. 

The general expression for the eigenvalues is 



fy J 1 

Ai ,2 = ^ ± -V(tr J)2 - 4det J, 



where in our case 





det J = —2a 
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We will view as fixed and /xi as a parameter; since we are interested 
only in the behavior of the eigenvalues near /ii = 0, /i 2 < 0, we can take 
the real part of Ai ,2 to be and thus obtain 

— — Re A = — yf 0. 
dfii a 

Next we check statement 2. 

We set 6 = +1 in (20.7.11) and obtain 

r = /iir + arz, 

z = ^i 2 + r^ - 

We next put this system into the normal form so that the coefficient a 
in the Poincare-Andronov-Hopf normal form can be computed. First we 
translate the fixed point to the origin by letting 



and hence obtain 





We next evaluate this equation on the bifurcation curve /ii = 0, /i 2 < 0 
and get 

P = a^/\p^\^ + ap^, 

The matrix associated with the linear part of this equation is given by 

/ 0 a^/\^\\ 

\2y\p2\ 0 )■ 

Introducing the linear tranformation 
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the equation becomes 




f ^ 

\V\2a1I2\ 



-i/|2o/r2| 






This is the standard form given in (20.2.13) from which the coefficient a in 
the Poincare- Andronov-Hopf normal form can be computed. 

From (20.2.14), this coefficient is given by 



1 

16 



[fuuu fuvv 9u 



16 >/| 2 a^ 



\fuv{^fuu fvv^ 



9uv (5 uu 9vv) fuuduu f vvQvv\^ 

and all partial derivatives are evaluated at (0, 0). 

In our case 



g = -|a|MU. 



Now we work out the partial derivatives (note all third derivatives vanish) 

fuu — 2 , guu — Oj 

fuv — Oj 9uv — |a|, 

fvv — |a|, Qvv — 0 - 

Thus, the coefficient a is identically zero. This tells us that we must retain 
(at least) cubic terms in our normal form in order to get any stability 
information concerning the Poincare-Andronov-Hopf bifurcation. (Note: we 
might have guessed that this would be the case. Why?) We will complete 
this analysis in Chapter 33. 

Now we must examine Poincare-Andronov-Hopf bifurcations in the re- 
maining case. 

Case III, a > 0. It is straightforward to verify that Poincare-Andronov- 
Hopf bifurcations occur on /xi = 0, /i 2 > 0 for the branch of fixed points 
given by 




and that, unfortunately, in this case also, the coefficient in the Poincare- 
Andronov-Hopf normal form is identically zero. 

We now want to summarize these results in the following bifurcation 
diagrams. 
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FIGURE 20.7.14. Case I: 6 = +1, a > 0. 

Case I: 6 = +1, a > 0. In Figure 20.7.14 we show phase portraits for differ- 
ent regions in the fii — fj ,2 plane. Note that by index theory there can be no 
periodic orbits in Case I (which might arise via some global bifurcation). 
This is because we must have r > 0, and the only fixed point in r > 0 is a 
saddle point. Thus, Figure 20.7.14 represents the complete story for Case 
L It remains only to interpret the r — z phase plane results in terms of 
the full three-dimensional vector field and consider the effects of the higher 
order terms of the normal form. We will do this later in this section. 

Case IIa,b: b = -1-1, a > 0. We show phase portraits for this case in differ- 
ent regions of the /Xi — ^2 plane in Figure 20.7.15. Note that the eigen- 
val ues of th e matrix associated with the vector field linearized about 

are given by 

A1.2 = — ± J ^ + -(m? - a'^1^2), 
a \ a 

and that these eigenvalues have nonzero imaginary part for 
1^2 < Ml ^2 -I- /2a^, fj,2 < 0. 

This gives us a better idea of the local orbit structure near these fixed 
points, and we illustrate this curve with a dotted line in Figure 20.7.15. We 
caution, however, that it is not a bifurcation curve. 
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FIGURE 20.7.15. Case IIa,b: b — +1, a > 0. 

Note that on /ri = 0 the truncated normal form has the first integral 

F{r,z) = ^r^/^ + ■ (20.7.26) 

This should give some insight into the “degenerate” Poincare- Andronov- 
Hopf bifurcation, since (20.7.26) implies that on /n = 0 the truncated 
normal form has a one-parameter family of periodic orbits. We expect that 
this degenerate situation will dramatically change when the effects of the 
higher order terms in the normal form are taken into account. In particu- 
lar, we would expect that a finite number of these periodic orbits survive. 
Exactly how many is a delicate issue that we will examine in Chapter 33. 

Case III: b = —I, a > 0. We show phase portraits in different regions of the 
Ml ~ M 2 plane in Figure 20.7.16. This case suffers from many of the same 
difficulties of Cases IIa,b. In particular, the truncated normal form has the 
first integral 

G(r, z) = (20.7.27) 

on /ii = 0. An examination of the first integral shows that, for /i 2 > 0, 
the truncated normal form has a one-parameter family of periodic orbits 
which limit on a heteroclinic cycle as shown in Figure 20.7.16. In Chapter 
33 we will consider the effects of the higher order terms in the normal form 
on this degenerate phase portrait. 
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Case IVa,b: b = —1, a < 0. We show phase portraits in the different regions 
in the /ii — /t 2 plane in Figure 20.7.17. Using index theory, it is easy to argue 
that these cases have no periodic orbits. Hence, Figure 20.7.17 represents 
the complete story for the r—z phase plane analysis of the truncated normal 
form. 




FIGURE 20.7.18. a) From Case I, b) From Case IVa,b. 

Relation of the Dynamics in the r — z Phase Plane to the Full 
Three-Dimensional Vector Field 

We now want to discuss how the dynamics of 

r = nir + arz, 

z = n^ + br"^ - z"^, (20.7.28) 

relate to 

r = nir + arz, 
z = yt 2 + br‘^ - z^ , 

d = w + ---. (20.7.29) 

We are interested in three types of invariant sets studied in (3.1.233). They 
are fixed points, periodic orbits, and heteroclinic cycles. We consider each 
case separately. 
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Fixed Points 

There are two cases, fixed points with r = 0 and fixed points with r > 0. It 
is easy to see (returning to the definition of r and 9 in terms of the original 
Cartesian coordinates) that fixed points of (20.7.28) with r = 0 correspond 
to fixed points of (20.7.29). Hyperbolic fixed points of (20.7.28) with r > 
0 correspond to periodic orbits of (20.7.29). This follows immediately by 
applying the method of averaging. See Figure 20.7.18 for a geometrical 
description. 

Periodic Orbits 

We have not developed the theoretical tools to treat this situation rigor- 
ously; this will be done in Volume 3. However, for now we will give a heuris- 
tic description of what is happening. Notice that the r and z components 
of (20.7.29) are independent of 9. This implies that the periodic orbit in the 
r — z plane is manifested as an invariant two-torus in the r — z — 9 phase 
space; see Figure 20.7.19. This is a very delicate situation regarding the 
higher order terms in the normal form, since they could dramatically affect 
the flow on the torus, in particular, whether or not we get quasiperiodic 
motion or phase locking (periodic motion). 




(a) 

3-d Flow 



Poincare Section 




FIGURE 20.7.19. a) Three-dimensional flow, b) Poincare section. 
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Heteroclinic Cycles 

Following the discussion for periodic orbits given above, the part of the 
heteroclinic cycle in Case III (^2 > 0) on the z axis is manifested as an in- 
variant line in r — z — 9 space, and the part of the heteroclinic cycle having 
r > 0 is manifested as an invariant sphere in r — z — 9 space; see Fig- 
ure 20.7.20. This is a very degenerate situation and could be dramatically 
affected by the higher order terms of the normal form. 

Step 5: Analysis of Global Bifurcations. As we have mentioned, this will be 
completed in Chapter 33 after we have developed the necessary theoretical 
tools. 

Step 6: Effects of the Higher Order Terms in the Normal Form. In Cases I 
and IVa,b the method of averaging essentially enables us to conclude that 
the higher order terms do not qualitatively change the dynamics. Thus we 
have found a versal deformation. The details of proving this, however, are 
left to the exercises. 




FIGURE 20.7.20. 

The remaining cases are more difficult and, ultimately, we will argue that 
versal deformations may not exist in some circumstances. 
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Before leaving this section we want to make some final remarks. 

Remark 1. This analysis reemphasizes the power of the method of nor- 
mal forms. As we will see throughout the remainder of this book, vector 
fields having phase spaces of dimension three or more can exhibit very 
complicated dynamics. In our case the method of normal forms utilized 
the structure of the vector field to naturally “separate” the variables. This 
enabled us to “get our foot in the door” by using powerful phase plane 
techniques. 

Remark 2. From the double-zero eigenvalue and now this case, a lesson 
to be learned is that Poincare-Andronov-Hopf bifurcations always cause 
us trouble in the sense of how they relate to global bifurcations and/or 
how they are affected by the consideration of the higher order terms of the 
normal form. 



20.7a Additional References and Applications for 
THE Hope-Steady State Bifurcation 

The Hopf-steady state bifurcation arises in a variety of applications and is 
still a current topic of research. Recent references are Arnold et al. [1988], 
Dawes [2000], Moore and Weiss [2000], Algaba et al. [1999a,b,d], Campbell 
[1999], Wu and Kupper [1998], Murphy and Lee [1998], Zimmermann et al. 
[1997], Allen and Moroz [1997], Wu and Kupper [1996], Solari and Oppo 
[1994], and Summers and Savage [1992]. 

20.7b Exercises 

1 . Consider a three-dimensional autonomous (r as large as necessary) vector field 
having a fixed point where the linear part, in Cartesian coordinates, takes the form 




The versal deformation of this nonhyperbolic fixed point was studied in some detail 
earlier. Suppose now we assume that the vector field is equivariant under the coordinate 
transformation 

(x,y,z) !->■ (x,y,-z). 

a) Show that the normal form in cylindrical coordinates is given by 

r = r{air^ + 022^) 
z = z{bir^ + b2Z^) 

9 = ui A . 

b) Show that a candidate for a versal deformation is given by 

r = r{fj,i + air^ -|- a2Z^), 
z = z(fj.2 -I- 6 ir^ -b b2Z^), 

9 = uj A . 
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c) Following the steps in the analysis of the nonsymmetric case, analyze this versal 
deformation completely, addressing all issues discussed for the nonsymmetric 
case. 

For an excellent review and bibliography of this nonhyperbolic fixed point with various 
symmetries see Langford [1985]. 

2. Consider a four-dimensional autonomous (r as large as necessary) vector field 
having a nonhyperbolic fixed point at which the linear part has the form 



/ 0 


-LJi 


0 




/w\ 


UJi 


0 


0 


0 1 


' X 1 


0 


0 


0 




1 V 1 


V 0 


0 


CJ2 


0 / 





a) Suppose mcui nu >2 ^ 0, \m\ |n| < 4. Then show that in polar coordinates a 

normal form is given by 

ri = airj -1- a2?’i’"2 + ‘ ‘ ‘ ^ 

V2 — b\r\r2 + h2v\ + • • • , 

i9i — ioi ■ , 

O 2 — (^2 ' ■ 

b) Show that a candidate for a versal deformation is given by 

fi = fiin - 1 - air^ + a 2 riT 2 , 

+ bir^r2 + b^r^, 

01 — o;i -1- - - - , 

0 2 — ‘ ‘ ‘ ■ 

c) Analyze this versal deformation completely and address all issues raised in this 
section. In particular, under what conditions may “three-tori” arise? 

d) For each of the resonant cases 

muJi nu )2 — 0, |m| -1- |n| < 4, 

discuss the codimension of the bifurcation and candidates for versal deforma- 
tions. 



3. Consider the following ordinary differential equation 



X = —^{y - 2 ) + 
v3 

LO 2 

y = - x) + [e - Ii{y - xz)]y, 



{x,y,z) G ! 



V3 



{x - y) + [e - y.{z - xy)]z, 



(20.7.30) 



where e > 0, /j- > 0, and cj are parameters. This system is useful for modeling and 
simulating synchronous machine systems in the study of power system dynamics; see 
Kaplan and Yardeni [1989] and Kaplan and Kottick [1983], [1985], [1987]. 

It should be obvious that 

(x,y,z) = (0,0,0) 

is a fixed point of (20.7.30) for all parameter values. We are interested in studying the 
bifurcations associated with this fixed point. 



a) Show that for e = 0 the eigenvalues of the matrix associated with the linearized 
vector field are given by 

0, ifcu. 
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b) Study the bifurcations associated with this fixed point for e = 0, u; 7 ^ 0, and 

e = 0, cj = 0. 



4. Consider the following class of feedback control systems studied by Holmes [1985]. 



X -\- 5x g{x) — — 2 , 
z + ocz — 0 : 7 ( 3 : — r), 



(20.7.31) 



where x and x represent the displacement and velocity, respectively, of an oscillatory 
system with nonlinear stiffness g{x) and linear damping 5x subject to negative feed- 
back control z. The controller has first-order dynamics with time constant ^ and gain 
7 . A constant or time-varying bias r can be applied. This system provides the sim- 
plest possible model for a nonlinear elastic system whose position is controlled by a 
servomechanism with negligible inertia; see Holmes and Moon [1983] for details. 

For this exercise we will assume 



g{x) = x{x^ - 1 ) 



and 



r = 0 . 



Rewriting (20.7.31) as a system gives 

X ^ y, 

y = x-x^-5y-z, (x,y,z)^S? (20.7.32) 

z — ol'^x — a 2 , 

with scalar parameters <5, o:, 7 > 0. This exercise is concerned with studying local 
bifurcations of (20.7.32). 

a) Show that (20.7.32) has fixed points at 



{x,y,z) = ( 0 , 0 , 0 ) = 0 



and 

(x, y, z) = (±71 - 7 , 0 , ± 7 V ^1 - 7 ) = P^, (7 < 1). 

b) Linearize about these three fixed points and show that (20.7.32) has the following 
bifurcation surfaces in (a, (5, 7 ) space. 



7^1 

7 = ^(a^ - 1 - aS — 1), 
7 > 1 

7 = + 

0 < 7 < 1 



one eigenvalue is zero for 0 

a pair of eigenvalues 
is pure imaginary for 0 

a pair of eigenvalues 
is pure imaginary for . 



c) Show that these three surfaces meet on the curve 



1 

7=1, S — — , 
oc 



where there is a double-zero eigenvalue with the third eigenvalue being -( 1 - 1 - 

a'^)/a. 

d) Fix a > 0 and study the bifurcations from the double-zero eigenvalue in the 
(<5, 7 ) plane. 

e) Describe all attractors as a function of 5 and 7 . Discuss the implications for the 
control problem. 

We remark that, although this exercise is concerned with local nonlinear analysis, 
global techniques for studying problems of the form (20.7.31) have been developed in 
Wiggins and Holmes [1987a], [1987b] and Wiggins [1988]. 
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5. Consider the following partial differential equation known as the complex Ginzburg- 
Landau (CGL) equation 



iAt + ^ 0A- 7|A|^A, (20.7.33) 

where (x,f) G X A{x,t) is complex and d = ocr + ioci, ^ + i^i, and 

7 = 7i? + 27J are complex numbers. 

If we set aj = 0, 7 / — 0, and /3 — 0, (20.7.33) reduces to 



iAt + anAtix = —1r\A\^ A, (20.7.34) 

which is a famous completely integrable partial differential equation known as the 
nonlinear Schrodinger (NLS) equation. We refer the reader to Newell [1985] for back- 
ground material and a discussion of the physical circumstances in which (20.7.33) and 
(20.7.34) arise. We will comment on this in more detail at the end of this exercise. 

a) Show that (20.7.33) is invariant under translations in space and time, i.e., under 
the transformation 

{x, f) 1-^ (x -I- Xq, t to). 

Show also that (20.7.33) is invariant under multiplication by a complex number 
of unit modulus, i.e., under the transformation 

A Ae^'^'o. 

Our goal in this exercise will be to study solutions of (20.7.33) that have the form 

A(x, t) = a{x)e'“*. (20.7.35) 

b) Substitute (20.7.35) into (20.7.33) and show that a{x) satisfies the following 
complex Duffing equation 

a" — (a -|- iffia -|- (7 -|- i6)\a\‘^ a — 0 , 

where 

a ^ [aR{uj + /3_r + aj/3/j/A, 

(3 = [aR0i — a/(cj + /3 r)]/A, 

7 = [oi? 7 fl + q;/ 7 /]/A, 

S = [ai?7j - aj7ii]/A, 

and 

A 2,2 

A = a^+a,. 

c) Letting a — b ic, show that (20.7.36) can be written as 

b' — d, 

d' = Oib — f3c — {-yb — Sc)(b^ c^), 
c — e, 

e' — f3b etc — {Sb -\- 7 c)(b^ - 1 - c^). 



(20.7.36) 



(20.7.37) 



d) Show that, for l3 — 5 — 0, (20.7.37) is a completely integrable Hamiltonian 
system with integrals 



d e <^.2.,2 x |7.2. ,2-.2 
H = -(c + & )+ -(c +h ) , 

m — be — cd. 
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e) Using the transformation 



iip 

a — pe , 



show that (20.7.36) can be written in the form 



p" - p{cp'f = ap- 7P^, 

(p'^p'Y = (0-Sp^)p\ 



f) Let 
and 



V = p'/p. 
2 / 

m — p (p , 



and show that (20.7.38) can be written in the form 



r — 2rv, 

/ 2 
V — — V + q ; — -yr, 

J.2 

m = (/3 — 5r)r. 



(20.7.38) 



(20.7.39) 



g) For (3 — 5 — Q, show that (20.7.39) has the form of a one-parameter family (with 
m playing the role of the parameter) of two-dimensional Hamiltonian systems 
with Hamiltonian function 



H{r, v\ m) — rv^ H ar -|- 



7 2 



(20.7.40) 



h) Using (20.7.40), give a complete description of the orbit structure of (20.7.39) 
for /3 — 5 — 0. 

i) Consider the symmetries of the CGL equation described in a). Discuss how these 
symmetries are manifested in (20.7.36), (20.7.37), (20.7.38), and (20.7.39). 

j) For (3'y — a<5, the point 

(r,u,m) = ( — = — ,0,0 
V7 <5 

is a fixed point of (20.7.39) where the eigenvalues of the matrix associated with 
the linearization are given by 

0, ±i>/2a. 

Study the bifurcations associated with this fixed point (take a > 0). 

k) Discuss the implications of the results obtained concerning the dynamics of the 
ordinary differential equations for the spatial and temporal structure of solutions 
to the CGL equation. 

1) In our original discussion of the GGL and NLS equations, we did not mention 
initial or boundary conditions. Discuss this issue in the context of the solutions 
we found. 



The CGL equation is a fundamental equation that arises in a variety of physical situa- 
tions. See Newell [1985], where it is derived in the context of nonlinear waves, and Land- 
man [1987], where it is used to understand the transition to turbulence in Poiseuille 
flow. Most of this exercise is based on results in Holmes [1986]; see also Holmes and 
Wood [1985] and Newton and Sirovich [1986a, b]. 
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20.8 Versal Deformations of Linear Hamiltonian 
Systems 



In this section we describe versal deformation theory for linear Hamiltonian 
systems that is very similar to the theory developed for matrices in Section 
20.5. 

The versal deformation theory for matrices used the Jordan canonical 
form in a variety of ways. Similarly, we need something like a Jordan 
canonical form theory for the matrices associated with linear Hamiltonian 
systems, or infinitesimally symplectic matrices. First, we establish some no- 
tation. We will be concerned with linear Hamiltonian vector fields. Hence, 
the Hamiltonians are quadratic forms of the following form: 

Ho{x) = ]^{Aox,x), ( 20 . 8 . 1 ) 



where 

x= (gi,...,g„,pi,...,p„), 

and Aq is a 2n X 2n real symmetric matrix. The associated Hamiltonian 
vector field is then given by 



X = JAqx, 



where 



J = 



0 id 
-id 0 



and “id” denotes the nx n identity matrix. By the term eigenvalues of the 
Hamiltonian we will mean the eigenvalues of the infinitesimally symplectic 
matrix JHq, and by the term Jordan block of the Hamiltonian we will mean 
Jordan block of the infinitesimally symplectic matrix J Aq. 



20.8a Williamson’s Theorem 

According to Proposition 14.3.5 from Chapter 14, the eigenvalues of the 
infinitesimally symplectic matrix J Aq occur in four possible ways: 



1. real pairs: {a,— a), 

2. purely imaginary pairs: (za, —ia) 
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3. complex quartets: (±a±f6), 

4. zero eigenvalues, with even multiplicity. 



This eigenvalue structure is reflected in the Jordan block structure. For 
example, if there is a Jordan block of dimension k corresponding to the real 
eigenvalue a, then there is a Jordan block of dimension k corresponding to 
the real eigenvalue —a. Similarly, if there is a Jordan block of dimension 
k corresponding to the complex eigenvalue a + ib, then there are three 
other Jordan blocks of dimension k corresponding to the remaining three 
eigenvalues in the quartet. In the case of purely imaginary eigenvalues we 
have to distinguish between Jordan blocks of even and odd dimension. For 
zero eigenvalues there can also be Jordan blocks of even and odd dimension, 
but if they are odd they occur in pairs. 

Williamson [1936] showed that the following is a complete list of normal 
forms corresponding to the different possible Jordan blocks. 



Pair of Jordan Blocks of Dimension k Corresponding to Real Eigenvalues 
±a 



k k — 1 

Ho = -a'^piQ^ + y^piqi+i. 

2 = 1 2 = 1 



( 20 . 8 . 2 ) 



Quartet of Jordan Blocks of Dimension k with Eigenvalues zba zb ib 

2k k 2k-2 

Ho = -a'^p^qi + h'^{p 2 ^-lq 2 i-P 2 iq 2 i-l)+ Piq^+ 2 ■ (20.8.3) 

Pair of Jordan Blocks with Odd Dimension k Corresponding to 
Eigenvalue 0 



fe-i 

H = Y,P^q^+l■ (20.8.4) 

2=1 



For A: = 1, i^o = 0- 
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Pair of Jordan Blocks with Even Dimension k Corresponding to 
Eigenvalue 0 



Ho = 




^ q^ql+l-^ 



l-l 

i-1 



where I = For k = 2, Hq = ±|<Zi- 



(20.8.5) 



Pair of Jordan Blocks with Odd Dimension k Corresponding to 
Eigenvalues Eia 



Ho — Tx ( {a^P2i-lP2l+l-2i + q2i-iq2l+l-2i) 



\i=l 



l-l 



21-2 



- ^ {a?P2iP2i-2i + q2iq2i-2i) - ^ Piqi+1, (20.8.6) 



i=l 



i=l 



where I = For k = 1, Hq = {afpi + qi)- 



Pair of Jordan Blocks with Even Dimension k Corresponding to 
Eigenvalues Eia 



1 

^o = T2 E P2i+lP2l + l-2i + P2i+2P2l+2-2i) 

^ \i=l 

^ / 1 \\ 

— 2_^ I -^q2i-iq2l+l-2i + q2iq2l+2-2i j j 

I I 

— 0,^ E/ 1 q2i + E/ <72i- 1 , 



(20.8.7) 



where / = |. For /c = 2, Hq = {^qj + ql) - o^piq2 + P2qi- 
We can now state Williamson’s theorem. 



Theorem 20.8.1 (Williamson) A real symplectic vector space with a 
given quadratic form Hq can he decomposed into a direct sum of skew or- 
thogonal real symplectic subspaces in such a way that the quadratic form 
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Hq is represented as a sum of quadratic forms of the types listed above on 
these subspaces. 

Williamson’s list of normal forms is not unique. Slightly different lists are 
given by Bryuno [1988], and Laub and Meyer [1974]. Bryuno [1988] gives 
an excellent history of the subject. Other relevant work is Burgoyne and 
Cushman [1977a], [1977b]. A version of Williamson’s theorem that applies 
to Hamiltonians invariant with respect to a compact Lie group can be 
found in Melbourne and Dellnitz [1993]. Churchill and Kummer [1999] give 
an algorithm for computing Williamson’s normal forms. Their paper also 
surveys and discusses a variety of issues associated with the computation 
of normal forms for Hamiltonian systems. 



20.8b Verbal Deformations of Jordan Blocks 
Corresponding to Repeated Eigenvalues 

We next describe versal deformations of the Jordan blocks corresponding to 
repeated eigenvalues described above classihed according to codimension. 
These results are due to Galin [1982] (see also Kocak [1984] and Hoveijn 
[1996]). The theory is very similar to the theory developed in our discussion 
of versal deformations of matrices in Section 20.5. We begin by describing 
the modihcations required for the dehnitions from that section. 

We denote the set of all 2n x 2n inhnitesimally symplectic matrices (with 
real entries) by M = sp(2n,M). M is a n(2n + 1) dimensional manifold. 
Matrices in M can be represented by 

JA, 



where 



J = 



0 id 
-id 0 



and A is a 2n X 2n symmetric matrix. We also consider the Lie group 
G = Sp{2n, K) of 2n x 2n symplectic matrices with real entries. 



Definition 20.8.2 (Adjoint Action) The group G acts on M according 
to the formula 

AdsJA = SJAS~^, {JAgM, SgG). 



(Ad stands for adjoint.) 
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Definition 20.8.3 (Orbit Under the Adjoint Action) Consider the 
orbit of an arbitrary fixed matrix JAq G M under the adjoint action of G 
on M; this is the set of points J A G M such that J A = S JAq S~^ for all 
SeG. 



The orbit of JAq under G forms a smooth submanifold of M, which we 
denote by N. Hence, the orbit, N, of JAq consists of all matrices similar 
to JAq. 



Definition 20.8.4 (Deformation of an Infinitesimally Symplectic 
Matric) A deformation of J Aq is a G'~ (r > 1) mapping 

JA: A ^ M, 

X^JA{\), 



where A G zs some parameter space and 



JA{Xq) — JAq. 



A deformation is also called a family, the variables Xi, i = 1, •••,£, are 
called the parameters, and A is called the base of the family. 



Definition 20.8.5 (Equivalence of Deformations) Two deformations 
JA{X), JB{X) of JAq are called equivalent if there exists a deformation of 
the identity matrix U(A) (C{Xq) = id), where C(A) is symplectic for each 
value of X, with the same base, such that 

JB{X) = G{X)JA{X)C~\X). 



Definition 20.8.6 (Induced Family) Let S C K™, A C &e open sets. 
Consider the C^ (r >1) mapping 

(/) : S — >■ A, 

(fill), 



with 4 >{pq) = Aq. 



The family induced from JA by the mapping (j) is called {(jf J A){p) and 
is defined by 

{(f*JA){p) = JA{cf{p)), p G K™. 
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Definition 20.8.7 (Versal, Universal, and Miniversal Deforma 
tion) A deformation JA{\) of an infinitesimally symplectic matrix JAq 
is said to he versal if any deformation JB{yi) of JAq is equivalent to a 
deformation induced from JA, i.e., 

JB{yi) = C{qi)JA{cf{yi))C-\qi) 



for some change of parameters 

(/) : S — >■ A, 



with C{plq) = id and = Aq. 

A versal deformation is said to be universal if the inducing mapping (i.e., 
change of parameters map) is determined uniquely by the deformation JB. 

A versal deformation is said to he miniversal if the dimension of the 
parameter space is the smallest possible for a versal deformation. 



Versal deformations for infinitesimally symplectic matrices are con- 
structed in essentially the same way as for arbitrary matrices in Section 
20.5. The centralizer of a matrix played a key role. 



Definition 20.8.8 (Centralizer) The centralizer of an infinitesimally 
symplectic matrix JA is the set Z of all infinitesimally symplectic matrices 
JC such that 

JCJA = JAJC. 



The dimension of the centralizer can be computed by the formula given 
in the following lemma. 



Lemma 20.8.9 The dimension of the centralizer of an infinitesimally 
symplectic matrix JAq depends on the Jordan form of the matrix, and is 
given by the formula 



dim Z = - 



z/O 



s(z) 

^(2j- l)rij{z) - 1 









j=i 



V U V 

+ ^[2(2j-l)m, + l]+2^^ min{mj, rhk} , 

j=l j=l k=l 



where ni{z) > ri2(z) > ■ ■ ■ > ris(z) are the dimensions of the Jordan blocks 
with eigenvalue z ^ Q, mi > m 2 > ■ ■ ■ > mi > m2 > • • • > rhv are 
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the dimensions of the Jordan blocks with eigenvalue z = 0, the numbers 
mj being even, while rhj are odd ( only one block out of each pair of Jordan 
blocks of odd dimension is taken into account). 

The relationship between the dimension of the centralizer and the orbit 
of an infinitesimally symplectic matrix under the adjoint action of G is 
given in the following proposition. 



Proposition 20.8.10 The dimension of the centralizer of an arbitrary in- 
finitesimally symplectic matrix JAq is equal to the codimension of its orbit 
in the space M of infinitesimally symplectic matrices. 



The following proposition is analogous to Proposition 20.5.8, and proven 
in the same way. 



Proposition 20.8.11 A deformation JA(A) of an arbitrary infinitesi- 
mally symplectic matrix JAq is versal if and only if the mapping JA{\) 
is transversal to N at the point A = Aq. 



Versal deformations for infinitesimally symplectic matrices are computed 
in exactly the same way as for arbitrary matrices as described in Section 
20.5. For a Jordan block JAIq, a versal deformation is of the form 

J Aq + J B, 



where B is symmetric, and has entries consisting of the appropriate number 
of parameters (as given by Lemma 20.8.9), with the parameters occurring 
in such a way that JB is not in the tangent space of N. Lemma 20.5.11 
and the inner product on the space of matrices (equation (20.5.11)) can 
be used to verify that JB has this property. Once this has been done, we 
can also recover Aq-\- B (by multiplying by J~^ , and write a corresponding 
versal deformation of the Hamiltonian, or quadratic form (20.8.1). Galin 
[1982] has made a list of these quadratic forms for codimension < 2, which 
we now reproduce. 



20.8c Versal Deformations of Quadratic 
Hamiltonians of Codimension < 2 



First, we introduce some notation. For a given eigenvalue A, we will de- 
note the corresponding Jordan block of dimension (or order) k by (A)*. 
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The following will be our shorthand notation denoting the Jordan block 
structure associated with the pairs or quartets of eigenvalues that occur 
for infinitesimally symplectic matrices (a and b stand for real numbers): 

(+a)'=(-a)'= = (±a)^ 

(+m)^(— ia)^ = (±m)^, 

(+a + z6)^(+a — ih)^{—a + ib)'^{—a — ib)^ = (±a ± ib)'^. 



Codimension Zero 

This is the situation when there are no multiple eigenvalues. 



Codimension One 

There are three cases of codimension one Jordan blocks. 



H{\) = -a{piqi +P2q2) + Piq2 + AiP2<7i- (20.8.8) 

(±ta)^ 

H{\) = p2qi - a‘^piq2 ± ^ + q^j + yP?- (20.8.9) 

0^ 

HW = + YPi (20.8.10) 



Codimension Two 



There are nine cases of codimension two Jordan blocks. 



(±a? 

H(X) = -a(piqi +P 2 q 2 + pzqz) + (piq 2 + P 2 qs) 

+ AiP29i + A2P3<7i (20.8.11) 

(±ta)3 

H(X) = - (Piq2+P2q3) T ^ {2a^PiP3 ~ a^pl + 2qiqs - ql) 

\ A2 2 

+ A1P291 + y 9 i 



( 20 . 8 . 12 ) 
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(±a ± ib)"^ 

H{\) = -a{piqi +P2Q2 + Pzqs + Piqi) 

+ b {piq2 - P2 Qi + P3Q4 - P4Q3) 

+ (pi93 + P2<?4) + AiP3(?i + A2P4gi (20.8.13) 

H{X) = -Piq2 ±]^{p\- 2 qiq 2 ) + \1P1P2 + yP 2 (20.8.14) 
{±a?{±hY 

H{\) = -a {piqi + P2q2) + Pi<?2 - b {pzq 3 + P 4 <? 4 ) 

+ P3?4 + AiP2<?1 + A2P493 (20.8.15) 



(±m)^ {±iby 



H{X) = p2qi - aV<?2 ± 7 ; ( + 92 ) +P493 



1 / 1 



Ai 



A^ 



- 6^P394 ± a f + qi] + '-^p{ + '-^p\ (20.8.16) 



(±a)2 {±ibf 

H{X) = -a{piqi + P292 ) +Pi92 + P4<?3 

- 6^P394 ± ^ + 94^ + A 1 P 291 + yP3 (20.8.17) 

(±a)2 02 

H{X) = -a{piqi +P 292 ) +P 192 T ^9s + A 1 P 291 + yPs (20.8.18) 
(±fa)^ 0^ 

H{X) = p2qi-a^Piq2±^ (^^9? + 9^^ T^93 + yPi + yP3 (20.8.19) 



20. 8d Versal Deformations of Linear, Reversible 
Dynamical Systems 

Versal deformations for linear, reversible systems have been worked out by 
Sevryuk [1986], [1992], and Hoveijn [1996]. 
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20. 8e Exercises 

1. Compute the form of the quadratic Hamiltonians in the codimension zero cases. Also 
compute the associated infinitesimally symplectic matrices. 

2. For the codimension one versal deformations, compute the associated infinitesimally 
symplectic matrices. 

3. Prove that the three codimension one cases in Galin’s list are indeed versal deforma- 
tions. 

4. Eigenvalue Movement. For the codimension one versal deformations, sketch the po- 
sitions of the eigenvalues in the complex plane for Ai < 0, Ai = 0, and Ai > 0. Indicate 
how the eigenvalues move as Ai is varied from positive to negative. The case (±m)^ 
is known as the Hamiltonian Hopf bifurcation. However, as pointed out in Meyer and 
Hall [1992], Hopf had nothing to do with the study of the bifurcations associated with 
this case, van der Meer [1985] and Meyer and Hall [1992] give discussions that put this 
in the correct historical context. An elementary exposition of the Hamiltonian Hopf 
bifurcation is given by Lahiri and Roy [2001]. 

5. Compute the normal form (the leading order terms beyond quadratic) for the Hamil- 
tonian Hopf bifurcation. 

6. Compute the normal form (the leading order terms beyond quadratic) for the the case 

i±ar. 



20.9 Elementary Hamiltonian Bifurcations 



In this section we describe some of the basic elementary Hamiltonian bifur- 
cations. Hamiltonian bifurcation theory is a rapidly developing area. The 
articles of Meyer [1975], [1986], and Golubitsky et al. [1995] and the books 
of Arnold et al. [1988] and Meyer and Hall [1992] provide a good overview of 
the subject of Hamiltonian bifurcations of equilibrium points and periodic 
orbits, with and without symmetry. 



20.9a One Degree-of-Freedom Systems 

We now describe three elementary bifurcations of equilibrium points in 
one-degree-of-freedom Hamiltonian systems in the spirit of the saddle- 
node, pitchfork, and Hopf bifurcations for general vector fields that we 
described earlier. The following discussion is taken from Golubitsky and 
Stewart [1987]. 



Hamiltonian Saddle Node 

The analog of the saddle-node bifurcation for Hamiltonian systems is a sit- 
uation where as a parameter is varied two equilibria, a saddle and a center. 
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FIGURE 20.9.1. Phase portraits corresponding to the Hamiltonian saddle-node 
bifurcation. 

collide and disappear, leaving no fixed points. Since the system is Hamil- 
tonian it is (at least) two dimensional, and one would expect the center to 
be surrounded by periodic orbits and the saddle to have a separatrix (or 
homoclinic orbit). The normal form for this bifurcation is given by 

H{p,q, X) = Xp + , (p, g. A) S x x (20.9.1) 

The phase portraits are shown in Figure 20.9.1. 



Hamiltonian Pitchfork: Z-i Symmetry 



The standard pitchfork bifurcation of equilibria is generic in one parameter 
families of vector fields equivariant with respect to a Z 2 action. Similarly, 
the Hamiltonian analog of the pitchfork occurs generically in one parameter 
families of Hamiltonians that are invariant with respect to a Z 2 action. The 
normal form is given by 
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H{p,q,\) = , (p, g, A) G X X (20.9.2) 



The phase portraits are shown in Figure 20.9.2. 

9 




FIGURE 20.9.2. Phase portraits corresponding to the Hamiltonian pitchfork bi- 
furcation. 



Symmetry 

The Poincare- Andronov-Hopf bifurcation is a generic bifurcation of equi- 
libria for one parameters families of two-dimensional vector fields having an 
symmetry. The normal form for a one-degree-of-freedom Hamiltonian 
in the neighborhood of an equilibrium point, and invariant with respect to 
an symmetry is given by 

H{p,q,X) = \{p'^ + q"^) + {p'^ + (p, g, A) G x x (20.9.3) 



However, the term Hamiltonian Hopf bifurcation is not applied to this 
situation. The phase portraits are shown in Figure 20.9.3. 
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A .<0 



FIGURE 20.9.3. Phase portraits corresponding to the Hamiltonian invariant with 
respect to an symmetry. The heavy circle represents a circle of equilibrium 
points. 

20.9b Exercises 

1. Verify that the phase portraits shown in the Figures 20.9.1, 20.9.2, and 20.9.3 are 
correct. 

2. For the Hamiltonian pitchfork bifurcation, an elliptic equilibrium point became a sad- 
dle (as the parameter passed through zero) with a symmetric pair of elliptic equilibria 
branching from it. Can you write down a normal form where a saddle type equilibrium 
point becomes an elliptic equilibrium point with two saddle type equilibria branching 
from it? 



3. Prove that these three normal forms (20.9.1), (20.9.2), and (20.9.3) are indeed versal 
deformations for the situations described. 
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20.9c Bifurcations Near Resonant Elliptic 
Equilibrium Points 



Bifurcations near resonant elliptic equilibrium points have been studied 
in detail by a number of authors. Arnold et al. [1988] and Meyer and Hall 
[1992] give excellent surveys. A great deal is known about bifurcations near 
resonant elliptic equilibria in the two degree-of- freedom case. There are 
two natural bifurcation parameters. One is the energy and the other is the 
“detuning”, which allows one to study the passage through the resonance. 
The main reason so much is known in the two degree-of-freedom case is 
that the truncated normal form is integrable. Very little is known about 
bifurcations in resonant elliptic equilibria in n degree- of-freedom systems, 
n > 3, when the multiplicity of the resonance is larger than two (the 
truncated normal forms associated with multiplicity one resonances are 
still integrable, regardless of the number of degrees of freedom, provided it 
is finite). Hoveijn [1992] and Haller and Wiggins [1996] give recent surveys 
of the literature related to resonant elliptic equilibria with three or more 
degrees-of- freedom . 

We now describe how one analyzes bifurcations near resonant equilibria 
in two degree-of-freedom Hamiltonian systems. 



First we recall some basic results from Section 19.10b. The quadratic 
part of the Hamiltonian is given by 



H{zi,Zi,Z2,Z2) 




0J2 

T 



U 2 I 



and is said to be in resonance provided ^ is a rational number. The trun- 
cated normal forms for the 1 : 1 (semisimple), 1 : 2, and 1 : 3 resonances 
were computed and found to be: 



1:1 Resonance 



H{zi, Zi, Z 2 , Z 2 ) — 2 (1 + + 2 + '^ 2 )| 2 : 2 p 

+ Ciiii|zi|^|2:2|^ -I- Co 022|2:2|^ + C220okl|‘* 

+ 2|c2oii |R eziZiZ2 + 2|ciio2|He2;iZ2z| -I- 2|c2oo2|Hez^z|, 



1:2 Resonance 



H{zi,Zi,Z2,Z2) 



-(1 -|- (5 i )|2: i |^ -|- (1 -|- <52)1221^ + 2 \ c 200 l \^^ zfz 2 , 
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1:3 Resonance 



H{zi, Zi, Z 2 , Z 2 ) — -(1 + 5i)|zip + -(1 + S 2 )\z 2 \'^ 

+ C0202|^2|^ + C202okl|"^ + Cllllkin22p 
+ 2|c300l|ReZiZ2- 



These expressions differ slightly from the expressions computed in Section 
19.10b in that we have included the (small) detuning parameters and 62 
which allow for the variation of the resonant frequencies. 

Following Arnold et al. [1988], each of these normal forms can be trans- 
formed into a parametrized family of one degree-of-freedom Hamiltonian 
systems plus an “action-angle pair” . The interesting dynamical information 
is contained in the family of one degree-of-freedom systems. In this way the 
entire phase space stucture can be uncovered. Moreover, the one degree- 
of-freedom bifurcations described in the previous section are particularly 
relevant for the analysis of the family of one degree-of-freedom systems. 

The transformation is carried out as follows. First one transforms the 
normal form to symplectic polar coordinates via the transformation 

Zk = , Zk = , fc = 1, 2. 



If the resonance relation between wi and UJ 2 is expressed in the form 

kiLUi + k 2 ljJ 2 = 0, 



where ki and k 2 are relatively prime (and not both zero), then we choose 
relatively prime integers h and I 2 satisfying 

k\l2 — ^2^1 = 1- 

Then the following symplectic transformation casts the truncated normal 
form into the desired form 



V?i = h-ip - k2X, 
(P2 = -/iV' + kiX, 
Pi = k\G III, 
P2 — feG + I2I ■ 



(20.9.4) 
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20. 9d Exercises 

1. Prove that (20.9.4) is a symplectic transformation. 

2. Recast the truncated normal forms for the 1 : 1 (semisimple), 1 : 2, and 1 : 3 resonances 
into the I — G — 'll) — x coordinates defined in (20.9.4). In these coordinates show that 
(5i and <52 can be effectively combined as one parameter, and give an interpretation of 
this parameter in terms of the resonance frequencies. Analyze the bifurcations (as a 
function of “energy” and “detuning”), and describe the phase space structure for the 
1 : 1 (semisimple), 1 : 2, and 1 : 3 resonances. 

3. Show that for a general resonance, i.e., 

+ fc2o;2 — 0, 

where k\ and ^2 are relatively prime (and not both zero), the truncated normal form 
has the general form 

1^1 I 1^21 

Hkj^,k2 = ^iPl + <^2P2 + F{pi, P2) + P2 ^ cos (fciipi + fe2<P2 + V'o) • 

See Arnold et al. [1988] for an analysis of this normal form. 

4. Using the coordinate transformations given above, analyze the truncated normal form 
associated with the Hamiltonian Hopf bifurcation (i.e., the case (±m)^) derived in the 
previous section. 

5. Using the coordinate transformations given above, analyze the truncated normal form 
associated with the case (±a)^ derived in the previous section. 

6. Survey the literature and find five applications where bifurcations associated with the 
case (±ia)^ arise. 

7. Survey the literature and find five applications where bifurcations associated with the 
case (±a)^ arise. 
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Bifurcations of Fixed Points of 
Maps 



The theory for bifurcations of fixed points of maps is very similar to the 
theory for vector fields. Therefore, we will not include as much detail but 
merely highlight the differences when they occur. 

Consider a p-parameter family of maps of K" into K” 

y^g{y,\), p G K", A G (21.0.1) 

where g is O'" (with r to be specified later, usually r > 5 is sufficient) on 
some sufficiently large open set in K" x Suppose (21.0.1) has a fixed 
point at (y. A) = (yo,Ao), i.e.. 



5(j/o, Ao) = yo- 



( 21 . 0 . 2 ) 



Then, just as in the case for vector fields, two questions naturally arise. 



1. Is the fixed point stable or unstable? 

2. How is the stability or instability affected as A is varied? 



As in the case for vector fields, an examination of the associated linearized 
map is the first place to start in order to answer these questions. The 
associated linearized map is given by 

C^i^j,y(yo,Ao)?, ^ G K”, (21.0.3) 

and, from Chapter 1, we know that if the fixed point is hyperbolic (i.e., none 
of the eigenvalues of I?j,y(yo, Aq) have unit modulus), then stability (resp. 
instability) in the linear approximation implies stability (resp. instability) 
of the fixed point of the nonlinear map. Moreover, using an implicit function 
theorem argument exactly like that given at the beginning of Section 20, it 
can be shown that, in a sufficiently small neighborhood of (yo, Aq), for each 
A there is a unique fixed point having the same stability type as (yo,Ao). 
Thus, hyperbolic fixed points are locally dynamically dull! 
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The fun begins when we consider Questions 1 and 2 above in the situation 
when the fixed point is not hyperbolic. Just as in the case for vector fields, 
the linear approximation cannot be used to determine stability, and varying 
A can result in the creation of new orbits (i.e., bifurcation). The simplest 
ways in which a fixed point of a map can be nonhyperbolic are the following. 

1. Dyg{yo, Ao) has a single eigenvalue equal to 1 with the remaining n— 1 
eigenvalues having moduli not equal to 1. 

2. Dyg{yo, Xq) has a single eigenvalue equal to -1 with the remaining 
n — 1 eigenvalues having moduli not equal to 1. 

3. Dyg{yQ, Aq) has two complex conjugate eigenvalues having modulus 1 
(which are not one of the first four roots of unity) with the remaining 
n — 2 eigenvalues having moduli not equal to 1. 



Using the center manifold theory, the analysis of the above situations 
can be reduced to the analysis of a p-parameter family of one-, one-, and 
two-dimensional maps, respectively. 

Moreover, all of our results also apply immediately to periodic points of 
maps. Suppose the map y >->■ g{y, X), y G K”, A G has a period k orbit for 
A = Ao, i.e., there is a sequence of length k, denoted {yo, yi, . . . , yk-i,yk}, 
with yk = yo, such that g{yi, Aq) = g{yi+i,Xo), t = 0, . . . , fc - 1. Then yi is 
a fixed point for the kth iterate of g, i.e., g^{yi, Aq) = yi, i = 0, . . ■ , k — 1. 
In other words, a periodic k orbit for a map gives rise to k fixed points 
for the fcth iterate of the map and our results can be applied to each point 
individually for the fcth iterate. 

We begin with the case of an eigenvalue equal to one. 



21.1 An Eigenvalue of 1 



In this case, the study of the orbit structure near the fixed point can be re- 
duced to the study of a parametrized family of maps on the one-dimensional 
center manifold. We suppose that the map on the center manifold is given 

by 

x^f{x,g), X G /r G (21.1.1) 

where, for now, we will consider only one parameter (if there is more than 
one parameter in the problem, we will consider all but one as fixed con- 
stants). In making the reduction to the center manifold, the fixed point 
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(j/o,^o) G R" X has been transformed to the origin in x (cf. 
Section 18.1) so that we have 

/( 0 , 0 )= 0 , ( 21 . 1 . 2 ) 

|^(0,0) = 1. (21.1.3) 

21.1a The Saddle-Node Bifurcation 



Consider the map 

a; I— >■ /(x, /i) = a; + /i =F x G p. G K^. (21.1.4) 

It is easy to verify that = (0,0) is a nonhyperbolic fixed point of 

(21.1.4) with eigenvalue 1, i.e., 



/(0,0)=0, (21.1.5) 

|^( 0 , 0 ) = 1 . ( 21 . 1 . 6 ) 

We are interested in the nature of the fixed points for (21.1.4) near 
= (0,0). Since (21.1.4) is so simple, we can solve for the fixed points 
directly as follows 

/(x, fj,) — X = fi ^ x^ = 0. (21.1.7) 

We show the two curves of fixed points in Figure 21.1.1 and leave it as an 
exercise for the reader to verify the stability types of the different branches 
of fixed points shown in this figure. We refer to the bifurcation occuring at 
(x,/i) = (0,0) as a saddle-node bifurcation. 

In analogy with the situation for vector fields (see Section 20.1c) we want 
to find general conditions (in terms of derivatives evaluated at the bifur- 
cation point) under which a map will undergo a saddle-node bifurcation, 
i.e., 



the map possesses a unique curve of fixed points in the x — plane 
passing through the bifurcation point which locally lies on one side 
of fi — 0. 



We proceed using the implicit function theorem exactly as in the case for 
vector fields. 

Consider a general one-parameter family of one-dimensional maps 
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(b) 

FIGURE 21.1.1. a) f{x, fi) = x + fi — x'^; h) f{x, fi) = x + fi + x'^. 



xi-^f{x,fi), X G /r G 


(21.1.8) 


with 




/(0,0)=0, 


(21.1.9) 




(21.1.10) 


The fixed points of (3.2.11) are given by 




/(x,/x) -x = h{x,fi) = 0. 


(21.1.11) 



We seek conditions under which (21.1.11) defines a curve in the x — 
plane with the properties described above. By the implicit function theo- 
rem, 

|( 0 , 0 ) = |( 0 , 0)^0 ( 21 , 1 . 12 ) 

implies that a single curve of fixed points passes through (x,/r) = (0,0); 
moreover, for x sufficiently small, this curve of fixed points can be rep- 
resented as a graph over the x variables, i.e., there exists a unique C’’ 
function, /x(a;), x sufficiently small, such that 



h{x, fi{x)) = f{x, n{x)) -x = 0. 



(21.1.13) 
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Now we simply require that 

^(0) = 0, (21.1.14) 

^(0)^0. (21.1.15) 

As was the case for vector fields (Section 20.1c), we obtain (21.1.14) 
and (21.1.15) in terms of derivatives of the map at the bifurcation point 
by implicitly differentiating (3.2.16). Following (20.1.32) and (20.1.35), we 
obtain 



f(0>0) i(O.O) 

f(0,0) f(0,0) ■ 



(21.1.16) 



(21.1.17) 



To summarize, a general one-parameter family of C’’ (r > 2) one-dimen- 
sional maps 



a: G /r G 



X 





(b) 



FIGURE 21.1.2. 
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undergoes a saddle-node bifurcation at (x,/r) = (0,0) if 

/(O, 



0 ) =0 

), 0 ) = 1 



nonhyperbolic fixed point 



with 



d^j. 



(0,0) ^0, 



52/ 

dx'^ 



( 0 , 0 )^ 0 . 



(21.1.18) 

(21.1.19) 

( 21 . 1 . 20 ) 



Moreover, the sign of (21.1.17) tells us on which side of ^ = 0 the curve 
of fixed points is located; we show the two cases in Figure 21.1.2 and leave 
it as an exercise for the reader to compute the possible stability types of 
the branches of fixed points shown in the figure. Thus, (21.1.4) can be 
viewed as a normal form for the saddle-node bifurcation of maps. Notice 
that, with the exception of the condition §y(0,0) = 1, the conditions for 
a one-parameter family of one-dimensional maps to undergo a saddle-node 
bifurcation in terms of derivatives of the map at the bifurcation point are 
exactly the same as those for vector fields (cf. (20.1.38), (20.1.39) and 
(20.1.40)). The reader should consider the implications of this. 

Before finishing our discussion of the saddle-node bifurcation we want 
to describe a way of geometrically visualizing the bifurcation which will be 
useful later on. In the x — y plane, the graph of f{x,y) (thinking of /i as 
fixed) is given by 

graph f{x,fj,) = {(x,y) G y = f{x,fi)}. 

The graph of the function g(x) = x is given by 

graph g{x) = { (x, y) G | y = x}. 

The intersection of these two graphs is given by 

{(x,y) G y = X = /(x,y)}. 



i.e., this is simply the set of fixed points for the map x i-G- /(x, y) . The latter 
is simply a more mathematically complete way of saying that we draw the 
curve y = /(x, y) (y fixed) and the line y = x in the x — y plane and look 
for their intersections. We illustrate this for the map 

X !—>■ X -I- /r — x^ 

in Figure 21.1.3 for different values of g. that graphically demonstrate the 
saddle-node bifurcation. 
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FIGURE 21.1.3. 

21.1b The Transcritic al Bifurcation 

Consider the maps 

X 1-^ f{x, fx) = X + fxx ^ a; € ^ G (21.1.21) 

It is easy to verify that (x,fi) = (0,0) is a nonhyperbolic fixed point of 
(21.1.21) with eigenvalue 1, i.e., 

/( 0 , 0 )= 0 , ( 21 . 1 . 22 ) 

df 

^(0,0) = 1. (21.1.23) 

The simplicity of (21.1.21) allows us to calculate all the fixed points 
relatively easily. They are given by 

f{x, ^) — X = iix ^ x^ = 0. (21.1.24) 

Hence, there are two curves of fixed points passing through the bifurcation 
point, 

x = 0 (21.1.25) 

and 

^i = ±x. (21.1.26) 

We illustrate the two cases in Figure 21.1.4 and leave it as an exercise 
for the reader to compute the stability types of the different curves of 
fixed points shown in this figure. We refer to this type of bifurcation as a 
transcritical bifurcation. 

We now want to find conditions for a general one-parameter family of C’’ 
(r > 2) one-dimensional maps to undergo a transcritical bifurcation, i.e., 
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X 





FIGURE 21.1.4. a) f{x, fi) = x + fj,x — x'^ ; b) f{x, fi) = x + fxx + x'^. 

in the x — ^ plane the map has two curves of fixed points passing 
through the origin and existing on both sides of jx — 0. 



Consider a C’’ (r > 2) map 

a: G /t G 



with 



/( 0 . 0 ) = 0 1 
K0).o) = i/ 



nonhyperbolic fixed point. 



The fixed points of (21.1.27) are given by 



f{x,fj.) -x = h{x,fi) = 0. 



(21.1.27) 



(21.1.28) 



(21.1.29) 



Henceforth the argument is very similar to that for the transcritical bi- 
furcation of one-parameter families of one-dimensional vector fields; see 
Section 20. Id. We want two curves of fixed points to pass through the 
bifurcation point (a;,/t) = (0,0), so we require that 



dh 

9/i 



(0,0) 



dl 



( 0 , 0 ) = 0 . 



Next, we want one of these curves of fixed points to be given by 



(21.1.30) 
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a; = 0; 

we thus take (21.1.29) of the form 

= xH{x,^) = x(F(x,/i) — 1), 

where 






X ^ 0 



= i I 

dx X — 0 



and, hence, 






X ^ 0 






(21.1.31) 

(21.1.32) 

(21.1.33) 

(21.1.34) 



Now we require H(x,fi) to have a unique curve of zeros passing through 
(x,fi) = (0,0) and existing on both sides of /r = 0. For this it is sufficient 
to have 

f)M r)T^ 

|^(0,0) = 1^(0, 0)^0 (21.1.35) 



and, using (21.1.33), (21.1.35) is the same as 

d^f 



dxd^ 



(0,0) ^0. 



(21.1.36) 



By the implicit function theorem, (21.1.36) implies that there exists a 
unique C’’ function ^(x) (x sufficiently small) such that 



H{x,fi{x)) = F(x,^(x)) — 1 = 0. 



Hence, we require 



d/i 

dx 



(0) ^ 0. 



Implicitly differentiating (21.1.37) gives 



^ If(o-o) If(o.o) ■ 



(21.1.37) 

(21.1.38) 

(21.1.39) 



Using (21.1.33), (21.1.39) becomes 



* Sl(O.O) 



(21.1.40) 



We now summarize the results. A one-parameter family of C’’ (r > 2) 
one-dimensional maps 



X H> f{x,fi), 



X G /a G 



(21.1.41) 
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(21.1.42) 

(21.1.43) 

(21.1.44) 

(21.1.45) 

(21.1.46) 



We remark that the sign of (21.1.40) gives us the slope of the curve of 
fixed points that is not x = 0. In Figure 21.1.5 we show the two cases and 
leave it as an exercise for the reader to compute the possible stability types 
for the different curves of fixed points shown in the figure; see Exercise 
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FIGURE 21.1.6. 

5. Thus (21.1.21) can be viewed as a normal form for the transcritical 
bifurcation. 

We end our discussion of the transcritical bifurcation by graphically 
showing the transcritical bifurcation in Figure 21.1.6 for the map 

X 1-^ X + — x'^; 

cf. the discussion at the end of Section 21.1a. 



21.1c The Pitchfork Bifurcation 

Consider the maps 

X f{x, fj,) = X + fix ^ x^, a; € /i G (21.1.47) 

It is easy to verify that (x, fi) = (0, 0) is a nonhyperbolic fixed point of 
(21.1.47) with eigenvalue 1, i.e., 

/(0,0)=0, (21.1.48) 

df 

^(0,0) = 1. (21.1.49) 

The fixed points of (21.1.47) are given by 

f{x, fi) — X = fix ^ x^ = 0. (21.1.50) 

Thus, there are two curves of fixed points passing through the bifurcation 
point, 
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X 






(a) 

X 






(b) 

FIGURE 21.1.7. a) f{x, fi) = x + fix — , h) f{x, fi) = x + fix + x^ . 

x = Q (21.1.51) 

and 

fi = ±x^ (21.1.52) 

We illustrate the two cases in Figure 21.1.7 and leave it as an exercise 
for the reader to verify the stability types of the different branches of fixed 
points shown in this figure. We refer to this type of bifurcation as a pitchfork 
bifurcation for maps. 

We now seek general conditions for a one-parameter family of C’’ (r > 3) 
one-dimensional maps to undergo a pitchfork bifurcation, i.e., 

in the x — fi plane the map has two curves of fixed points passing 
through the bifurcation point; one curve exists on both sides of /r = 0 
and the other lies locally to one side of /r = 0. 

Consider a C’’ (r > 3) map 

xi-^f{x,fi), a: G /t G (21.1.53) 





with 



/(O.O) =0 

ao>.o)=i 



nonhyperbolic fixed point. 



(21.1.54) 
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The fixed points of (21.1.53) are given by 

f{x,fj,) — x = h{x,^) = 0. (21.1.55) 

Henceforth, the discussion is very similar to the discussion of the pitchfork 
bifurcation for vector fields (see Section 20.1). In order to have more than 
one curve of fixed points passing through (x,/i) = (0,0), we must have 

Oh 0 f 

^(0,0)= ^(0,0) = 0. (21.1.56) 



Since we want one curve of fixed points to be x = 0, we take (21.1.55) of 



the form 


h{x, fj.) = xH{ 


x,fi) = x{F{x,fi) - 


-1), 


(21.1.57) 


where 


H{x,fi) = ■ 


iiiro,M), 


X yf 0 

X = 0 


} 


(21.1.58) 


and, hence. 


F{x,^J,) = 1 




X yf o') 

X = 0 J 




(21.1.59) 



Since we want only one additional curve of fixed points to pass through 
(x,^) = (0,0), we require 



|(((0.0) = 2^(0. 0)^0. 



Using (21.1.59), (21.1.60) becomes 

d^f 

dxd^ 



( 0 , 0 ) ^ 0 . 



(21.1.60) 



(21.1.61) 



The implicit function theorem and (21.1.61) imply that there is a unique 
C’’ function, /r(x) (x sufficiently small), such that 

H{x,fj,{x)) = F(x,/i(x)) — 1 = 0. (21.1.62) 



We require 



and 






dx"^ 



( 0 ) ^ 0 . 



Implicitly differentiating (21.1.62) gives 



(21.1.63) 

(21.1.64) 
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-|f(0,0) -f(0,0) 

f (0-0) f (0-0) ’ 


(21.1.65) 


-0(0,0) -0(0,0) 
f(O-O) f(0,0) ■ 

Using (21.1.59), (21.1.65) and (21.1.66) become 


(21.1.66) 


* iSfe(O.O)’ 


(21.1.67) 


-0(0-0) 

S:(0.0)’ 


(21.1.68) 


To summarize, a one-parameter family of C’' (r > 3) 
maps 

x^f{x,^), a: G /r G 

having a nonhyperbolic fixed point, i.e., 


one-dimensional 

(21.1.69) 


/(0,0)=0, 


(21.1.70) 




(21.1.71) 


undergoes a pitchfork bifurcation at {x,^£) = (0,0) if 




|(0,0,.0. 


(21.1.72) 


f 

^(0-0) = 0, 


(21.1.73) 




(21.1.74) 




(21.1.75) 



Moreover, the sign of (21.1.68) tells us on which side of /i = 0 that one 
of the curves of fixed points lies. We illustrate both cases in Figure 21.1.8 
and leave it as an exercise for the reader to compute the possible stability 
types of the different branches shown in Figure 21.1.8. Thus, we can view 
(21.1.47) as a normal form for the pitchfork bifurcation. 

We end our discussion of the pitchfork bifurcation by graphically showing 
the bifurcation for 

X ^ X + fxx — 

in Figure 21.1.9 in the manner discussed at the end of Section 21.1a. 
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FIGURE 21.1.8. 

a) (-0(O,O)/|^(O,O)) 

b) (-0(O,O)/|^(O,O)) 



> 0 ; 
< 0. 




FIGURE 21.1.9. 

21.2 An Eigenvalue of —1: Period Doubling 

Suppose that our one-parameter family of C’’ (r > 3) one-dimensional 
maps has a nonhyperbolic fixed point, and the eigenvalue associated with 
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the linearization of the map about the fixed point is -1 rather than 1. Up 
to this point the bifurcations of one-parameter families of one-dimensional 
maps have been very much the same as the analogous cases for vector fields. 
However, the case of an eigenvalue equal to -1 is fundamentally different 
and does not have an analog with one- dimensional vector field dynamics. 
We begin by studying a specific example. 



21.2a Example 

Consider the following one-parameter family of one-dimensional maps 

X ^ f{x, y) = —X — iix , a: G /i G (21.2.1) 

It is easy to verify that (21.2.1) has a nonhyperbolic fixed point at (x, /x) = 
(0,0) with eigenvalue —1, i.e., 

/( 0 , 0 )= 0 , ( 21 . 2 . 2 ) 

df 

^(0,0) = -l. (21.2.3) 

The fixed points of (21.2.1) can be calculated directly and are given by 




FIGURE 21.2.1. 

f{x, fa) — X = x{x^ — (2 fi)) = 0. (21.2.4) 

Thus, (21.2.1) has two curves of fixed points, 

x = 0 (21.2.5) 



and 
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x'^ = 2 + ii, (21.2.6) 

but only (21.2.5) passes through the bifurcation point = (0,0). In 

Figure 21.2.1 



we illustrate the two curves of fixed points and leave it as an exercise 
for the reader to verify the stability types for the different curves of fixed 
points shown in the figure. In particular we have 



and 



a; = 0 is 



unstable for ^ < —2, 
stable for — 2 < ^ < 0, 
unstable for fi > 0, 



( unstable for fj, > —2, 

( does not exist for ^ < —2. 



(21.2.7) 



( 21 . 2 . 8 ) 



From (21.2.7) and (21.2.8) we can immediately see there is a problem, 
namely, that for /x > 0, the map has exactly three fixed points and all are 
unstable. (Note: this situation could not occur for one-dimensional vector 
fields.) A way out of this difficulty would be provided if stable periodic 
orbits bifurcated from {x,fi) = (0,0). We will see that this is indeed the 
case. 

Consider the second iterate of (21.2.1), i.e., 

X !->■ /^(x, fi) = X + ^(2 + fj,)x — 2a;^ -|- 0(4). (21.2.9) 



It is easy to verify that (21.2.9) has a nonhyperbolic fixed point at 
(x,/i) = (0,0) having an eigenvalue of 1, i.e.. 



Moreover, 



/"(0,0) = 


0, 


(21.2.10) 


df 


= 1. 


(21.2.11) 


df 


= 0, 


(21.2.12) 


82 f2 


= 2, 


(21.2.13) 


82 f2 

afO’O*- 


= 0, 


(21.2.14) 


83 t2 


-12. 


(21.2.15) 
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X 




(b) 



FIGURE 21.2.2. 

a) (-^(0,0)/|^(0,0))>0; 

b) (-^(0,0)/|^(0.0))<0. 

Hence, from (21.1.72), (21.1.73), (21.1.74), and (21.1.75), (21.2.12), 
(21.2.13), (21.2.14), and (21.2.15) imply that the second iterate of (21.2.1) 
undergoes a pitchfork bifurcation at = (0,0). Since the new fixed 

points of p{x,^) are not fixed points of f{x,fj,), they must be period two 
points of f{x, /i). Hence, f{x, fj,) is said to have undergone a period- doubling 
bifurcation at (a:,/i) = (0,0). 



21.2b The Period-Doubling Bifurcation 

Consider a one-parameter family of C’' (r > 3) one-dimensional maps 

xi-^f{x,fi), X € /X G (21.2.16) 

We seek conditions for (21.2.16) to undergo a period-doubling bifurca- 
tion. The previous example will be our guide. It should be clear from the ex- 
ample that conditions sufficient for (21.2.16) to undergo a period-doubling 
bifurcation are for the map to have a nonhyperbolic fixed point with eigen- 
value — 1 and for the second iterate of the map to undergo a pitchfork 
bifurcation at the same nonhyperbolic fixed point. To summarize, using 
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FIGURE 21.2.3. 

(21.1.70), (21.1.71), (21.1.72), (21.1.73), (21.1.74), and (21.1.75), it is suf- 
ficient for (21.2.16) to satisfy 



/( 0 , 0 )= 0 , 






dp 

dx'^ 

dxd^j, 

W 

dx^ 



( 0 , 0 ) = 0 , 
( 0 , 0 ) = 0 , 
(0,0) ^0, 
( 0 , 0 )^ 0 . 



(21.2.17) 

(21.2.18) 

(21.2.19) 

( 21 . 2 . 20 ) 
( 21 . 2 . 21 ) 
( 21 . 2 . 22 ) 



Moreover, the sign of ^ j ^ which side of 

/X = 0 the period two points lie. We show both cases in Figure 21.2.2 and 
leave it as an exercise for the reader to compute the possible stability types 
for the different curves of fixed points shown in the figure; see Exercise 7. 



Finally, we demonstrate graphically the period-doubling bifurcation for 

X !—>■ —X — /J.X + x^ = f{x, /i) 

and the associated pitchfork bifurcation for p{x,ii) in the graphical man- 
ner discribed at the end of Section 21.1a in Figure 21.2.3. 
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21.3 A Pair of Eigenvalues of Modulus 1: The 
Naimark-Sacker Bifurcation 

This section describes the map analog of the Poincare- Andronov-Hopf bi- 
furcation for vector fields but with some very different twists. Although this 
bifurcation often goes by the name of “Hopf bifurcation for maps,” this is 
misleading because the bifurcation theorem was first proved independently 
by Naimark [1959] and Sacker [1965]. Consequently, we will use the term 
“Naimark-Sacker bifurcation.” 

We know that in this situation the study of the dynamics of (21.0.1) near 
the fixed point (j/Qj Aq) S M" x can be reduced to the study of (21.0.1) 
restricted to a p-parameter family of two-dimensional center manifolds. We 
assume that the reduced map has been calculated and is given by 

X G p G (21.3.1) 

where we take p = 1. If there is more than one parameter, we consider all 
but one as fixed and denote it as /i. In restricting the map to the center 
manifold, some preliminary transformations have been made so that the 
fixed point of (21.3.1) is given by (x,p) = (0,0), i.e., we have 

/(0,0)=0, (21.3.2) 

with the matrix 

D,f{0,0) (21.3.3) 

having two complex conjugate eigenvalues, denoted A(0), A(0), with 

|A(0)| = 1. (21.3.4) 

We will also require that 

A”(0) yf 1, n=l,2,3,4. (21.3.5) 

(Note: if A(0) satisfies (21.3.5), then so does A(0), and vice versa.) 

We showed in Example 19.3a that under these conditions a normal form 
for (21.3.1) is given by 

z 1 -^ \{^)z + c{fi)z'^z + 0{4), z G C, p G (21.3.6) 

We transform (21.3.6) into polar coordinates by letting 



and obtain 
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|A(Ai)| (rb + 0{r'^)J , 

9^0 + ^(/i) + ^ l^lm (^^1^ j ^ + 0{r^), (21.3.7) 

where 

tan"i (21.3.8) 

27t o:(fj,) 



c(fi) = a(fi) + ioj(fi). (21.3.9) 

We then Taylor expand the coefficients of (21.3.7) about /i = 0 and obtain 



( 1+ ^|A(fi)l 



/r r + Re 



^—0 



fm] 

[mo)J 






01-^0 + (()( 0 ) + 

+C>(M^M^■^r3), 



/J.—0 






‘A(0) 



(21.3.10) 



where we have used the condition that |A(0)| = 1. Note that, since A"(0) yf 
1, where n = 1, 2, 3,4, from (21.3.8) we see that ((>(0) yf 0. We simplify the 
notation associated with (21.3.10) by setting 





-•e) 

c/)o = c/>(0), 



fi—0 



IJ-—0 



. _ 1 T »(“) 



hence, (21.3.10) becomes 

r 1-4 r + {d^ + ar‘^)r + 0{^^r, , r'^), 

0 1-^ 0 + 4>o + (j>i^ + br"^ + 0(^^, r^). (21.3.11) 

We are interested in the dynamics of (21.3.11) for r small, /x small. Our 
strategy for understanding this will be the same as in our study of the 
Poincare- Andronov-Hopf bifurcation for vector fields (cf. Section 20.2); 
namely, we will study the dynamics of (21.3.11) with the higher order terms 
neglected (i.e., the truncated normal form) and then try to understand how 
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the dynamics of the truncated normal form are affected by the higher order 
terms. 



The truncated normal form is given by 



r e- >■ r + {dfi + ar^)r, 

e^9 + (j)o + <l)iti + br^. (21.3.12) 

Note that r = 0 is a fixed point of (21.3.12) that is 



and 



asymptotically stable 

unstable 

unstable 



for dfx < 0, 
for dfx > 0, 
for ^ = 0, a > 0, 



asymptotically stable for ^ = 0, a < 0. 



We recall our study of the truncated normal form for the Poincare- Andro- 
nov-Hopf bifurcation for vector fields (see Section 20.2). In that case, fixed 
points of the r component of the truncated normal form, with r > 0, corre- 
sponded to periodic orbits. Something geometrically (but not dynamically) 
similar happens for maps also. 



is a circle which is in- 
variant under the dynamics generated by (21.3.12). 

Proof: That this set of points is a circle is obvious. The fact that it is 
invariant under the dynamics generated by (21.3.12) follows from the fact 
that the r coordinate of points starting on the circle do not change under 
iteration by (21.3.12). □ 



Lemma 21.3.1 < (r,9) € 



X S'! 



— fid 



It should be clear that the invariant circle can exist for either /i > 0 
or fj, < 0 depending on the signs of d and a and that there will be only 
one invariant circle at a distance from the origin. Stability of the 

invariant circle is determined by the sign of a. This is a new concept of 
stability not previously discussed in this book, namely, the stability of an 
invariant set. Its meaning, hopefully, is intuitively clear. The invariant circle 
is stable if initial conditions “sufficiently near” the circle stay near the circle 
under all forward iterations by (21.3.12). It is asymptotically stable if the 
points actually approach the circle. We summarize this in the following 
lemma. 



Lemma 21.3.2 The invariant circle is asymptotically stable for a < 0 and 
unstable for a > 0. 
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Proof: Since the r component of (21.3.12) is independent of 9, this problem 
reduces to the study of the stability of a fixed point of a one-dimensional 
map (i.e., the 6 dynamics are irrelevant). We leave the details as an exercise 
for the reader. □ 





n<0 



^i»o 



^>0 



FIGURE 21.3.1. d > 0, a > 0. 





H <0 ^ 1=0 n >0 



FIGURE 21.3.2. d > 0, a < 0. 



We now describe the four possible cases for the bifurcation of an invariant 
circle from a fixed point. 

Case 1: d > 0, a > 0. In this case, the origin is an unstable fixed point for 
fj, > 0 and an asymptotically stable fixed point for /i < 0 with an unstable 
invariant circle for /i < 0; see Figure 21.3.1. 
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Case 2: d > 0, a < 0. In this case, the origin is an unstable fixed point for 
fj, > 0 and an asymptotically stable fixed point for /i < 0 with an asymp- 
totically stable invariant circle for /i > 0; see Figure 21.3.2. 




FIGURE 21.3.3. d < 0, a > 0. 







L. 

|i<0 ^L>0 




FIGURE 21.3.4. d < 0, a < 0. 

Case 3: d < 0, a > 0. In this case, the origin is an asymptotically stable 
fixed point for /r > 0 and an unstable fixed point for /x < 0 with an 
unstable invariant circle for /i > 0; see Figure 21.3.3. 

Case 4: d < 0, a < 0. In this case, the origin is an asymptotically stable 
fixed point for /i > 0 and an unstable fixed point for /x < 0 with an 
asymptotically stable invariant circle for /x < 0; see Figure 21.3.4. 



We make the following general remarks. 
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Remark 1. For a > 0, the invariant circle can exist for either /j, < 0 (Case 
1) or /X > 0 (Case 3) and, in each case, the invariant circle is unstable. 
Similarly, for a < 0, the invariant circle can exist for either /x < 0 (Case 4) 
or /i > 0 (Case 2) and, in each case, the invariant circle is asymptotically 
stable. Hence, the quantity a determines the stability of the invariant circle, 
but it does not tell us on which side of /x = 0 the invariant circle exists. 

Remark 2. Recall that 

Hence, for d > 0, the eigenvalues cross from inside to outside the unit 
circle as /x increases through zero and, for d < 0, the eigenvalues cross 
from outside to inside the unit circle as ^ increases through zero. Thus, 
for d > 0, it follows that the origin is asymptotically stable for /x < 0 and 
unstable for /x > 0. Similarly, for d < 0, the origin is unstable for /x < 0 
and asymptotically stable for /x > 0. 

At this point the reader is probably struck by the similarities between 
the analysis of the truncated normal form (21.3.12) and the analysis for 
the normal form associated with the Poincare- Andronov-Hopf bifurcation 
(see Section 20.2). However, we want to stress that the situation for maps 
is fundamentally different indeed from this and from all other bifurcations 
we have studied thus far. In all other bifurcations (either in vector fields 
or maps) the invariant sets that are created consist of single orbits while, 
in this case, the bifurcation consists of an invariant surface (i.e., a circle) 
which contains many different orbits. We can study the dynamics on the 
invariant circle by studying the dynamics of (21.3.12) restricted to the 
invariant circle (i.e., by considering only initial conditions that start on the 
invariant circle). Points on the invariant circle have initial r coordinates 
given by 




so that the associated circle map is given by 

0 1—^ 9 (j)Q ^ /X. (21.3.13) 

The dynamics of (21.3.12) are easy to understand and depend entirely on 
the quantity (po + {(pi — ^)/x- If 4>o + {(pi — f)fJ, is rational, then all orbits on 
the invariant circle are periodic. If (po + {(pi — y)/x is irrational, then all orbits 
on the invariant circle densely fill the circle. We proved these statements in 
Chapter 10, Theorem 10.4.2. Thus, as /x is varied, the orbit structure on the 
invariant circle continually alternates between periodic and quasiperiodic. 
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Although all of this analysis is for the truncated normal form (21.3.12), 
our real interest is in the full normal form (21.3.11). For this we have the 
following theorem. 



Theorem 21.3.3 (Naimark-Sacker bifurcation) Consider the full nor- 
mal form (21.3.11). Then, for /i sufficiently small, Cases 1, 2, 3, and f 
described above hold. 



Proof: The proof requires more technical machinery than the proof of the 
Poincare-Andronov-Hopf bifurcation for vector fields. This should come 
as no surprise, since the Poincare-Bendixson theorem and the method of 
averaging (do not immediately apply to maps. For this reason, we will not 
state this proof here; excellent expositions of the proof may be found in, 
e.g., looss [1979]. □ 



We must take care to interpret this theorem correctly. Roughly speaking, 
it tells us that the “tail” of the normal form does not qualitatively affect 
the bifurcation of the invariant circle exhibited by the truncated normal 
form. However, it tells us nothing about the dynamics on the invariant 
circle. Indeed, we would expect the higher order terms of the normal form 
to have a very important affect on the circle map associated with the in- 
variant circle of the full normal form (21.3.11). This is because the circle 
map associated with the invariant circle of the truncated normal form is 
structurally unstable. 



21.4 The Codimension of Local Bifurcations of 
Maps 



The degeneracy of fixed points of maps can be specified by the concept 
of codimension as described in Section 20.4. The codimension of a case is 
computed using exactly the same procedure (cf. Definition 20.4.6). Indeed, 
for a C” vector field 

X = f{x), X G M", (21.4.1) 

we are interested in the local structure of 

/(x) = 0, (21.4.2) 



and, for a C” map. 



X g{x), 



a: G K” 



(21.4.3) 
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we are interested in the local structure of 



g{x)-x = 0. (21.4.4) 

Thus, it should be clear that, mathematically, (21.4.1) and (21.4.3) are 
the same. We will, therefore, state the results for maps only and leave the 
verification to the reader. 



21.4a One-Dimensional Maps 

The maps 

X ^ X + ax^ + 0{x^), (21.4.5) 

X ^ —X + ax^ + 0{x‘^), (21.4.6) 

have nonhyperbolic fixed points at a; = 0. Using the techniques developed 
in Chapter 20, Section 20.4b it is easy to see that these are codimension 
one fixed points with versal deformations given by 

a; I— >■ x + /i =F , (21.4.7) 

xe^—x + gx^x^. (21.4.8) 

Hence, the generic bifurcations of fixed points of one-parameter families of 
one-dimensional maps are saddle-nodes and period-doublings. 

Similarly, the map 

X ^ X + ax^ + 0{x^) (21.4.9) 

has a nonhyperbolic fixed point at a; = 0. It is easy to show that a; = 0 is 
a codimension two fixed point, and a versal deformation is given by 

X 1-^ X + fii + T ■ (21.4.10) 



21.4b Two-Dimensional Maps 



Suppose we have a two-dimensional map having a fixed point at the origin 
with the two eigenvalues of the associated linear map being complex con- 
jugates with modulus one. We denote the two eigenvalues by A and A. One 
can assign a codimension to this nonhyperbolic fixed point using the meth- 
ods of Section 20.4; the number obtained will depend on A and A (we will do 
this shortly). We can then construct a candidate for a versal deformation 
using the same methods. However, these will not give versal deformations. 
One of the obstructions to this involves the dynamics on invariant circles. 
As we have seen earlier (see Section 21.3), all of the higher order terms in 
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the normal form may affect the dynamics on the invariant circle. Neverthe- 
less, we will give the codimension and the associated parametrized families 
using the techniques of Section 20.4 for the different cases. 

The case 

n=l,2,3,4, (21.4.11) 

is codimension one. The associated one-parameter family of normal forms 
for this bifurcation is 

z ^ {1 + p)z + cz^z, zGC (21.4.12) 

(see Example 19.3a). If we ignore the dynamics on the invariant circle, then 
(21.4.12) captures all local dynamics. 

The cases ruled out by (21.4.11) are referred to as the strong resonances. 
The cases n = 3 and n = 4 are codimension one with associated one- 
parameter families of normal forms given by 

z ^ {1 + fi)z + ciz“^ + C 2 z“^z, n = 3, z G C, ^ G (21.4.13) 

z 1 -^ {I + fx)z + ciz^ + C 2 z'^z, n = 4, z G C, ^ G (21.4.14) 

The cases n = 1 and n = 2 correspond to double 1 and double -1 eigenval- 
ues, respectively. If the matrices associated with the linear parts (in Jordan 
canonical form) are given by 

(I \y n = l, (21.4.15) 



and 

n = 2, (21.4.16) 

then these cases are codimension two with associated two-parameter fami- 
lies of normal forms given by 



X ^ X + y, 

y ^ jj,i + y2X + y + ax"^ + bxy, 



(21.4.17) 



X X + y, 

y I— >■ nix + y,2y + y + ax^ + bx^y 



(21.4.18) 



Arnold [1983] studied (21.4.13), (21.4.14), (21.4.17), and (21.4.18) in detail 
by using the important local technique of interpolating a discrete map by 
a ffow. We will work out some of the details in the exercises. The strongly 
resonant cases arise in applications in, for example, the situation of applying 
a periodic external force to a system which would freely oscillate with a 
given frequency. Strong resonance would occur when the forcing frequency 
and natural frequency were commensurate in the ratios 1/1, 1/2, 1/3, and 
1/4. The surprising fact is that, in the cases n = 1 and n = 2, chaotic 
motions of the Smale-horseshoe type (see Gambaudo [1985]) may arise. 
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21.5 Exercises 



1. Verify the stability for the branches of fixed points of the maps (21.1.4) shown in 
Figure 21.1.1. For one dimensional maps (as opposed to one dimensional vector fields) 
period doubling bifurcations are possible, so this may limit the range in for which 
the stabilities indicated in Figure 21.1.1 are valid. 

2. Verify the stability for the branches of fixed points of the maps (21.1.21) shown in 

Figure 21.1.4. For one dimensional maps (as opposed to one dimensional vector fields) 

period doubling bifurcations are possible, so this may limit the range in for which 
the stabilities indicated in Figure 21.1.4 are valid. 

3. Verify the stability for the branches of fixed points of the maps (21.1.47) shown in 

Figure 21.1.7. For one dimensional maps (as opposed to one dimensional vector fields) 

period doubling bifurcations are possible, so this may limit the range in for which 
the stabilities indicated in Figure 21.1.7 are valid. 

4. Consider the saddle-node bifurcation for maps and Figure 21.1.2. For the case 
(-|^(0.0)/|^(0,0)) > 0, give conditions under which the upper part of the curve 
of fixed points is stable and the lower part is unstable. Alternatively, give conditions 
under which the upper part of the curve of fixed points is unstable and the lower part 
is stable. 

Repeat the exercise for the case (0, 0)/ ^ (0, 0)^ < 0. 

5. Consider the transcritical bifurcation for maps and Figure 21.1.5. For the case 
{-1^(0, 0)/ ^{0,0)) > 0, give conditions for a: — 0 to be stable for /a > 0 and 
unstable for /a < 0. Alternatively, give conditions for x = 0 to be unstable for // > 0 
and stable for /x < 0. 

Repeat the exercise for the case (0, 0)/ (0, 0)^ < 0. 

6. Consider the pitchfork bifurcation for maps and Figure 21.1.8. For the case 
(-|^(0.0)/j|^(0.0)) > 0, give conditions for x — 0 to be stable for /x > 0 and 
unstable for /x < 0. Alternatively, give conditions for x = 0 to be unstable for /x > 0 
and stable for /x < 0. 

Repeat the exercise for the case (0, 0)/ (0, 0)^ < 0. 

7. Consider the period-doubling bifurcation for maps and Figure 21.2.2. For the case 

( q 3 f2 p.2 f2 \ 

— ^3 (0; 0)/ (0; 0) j > 0, give conditions for the period one points, x = 0, to be 
stable for /x > 0 and unstable for /x < 0. Alternatively, give conditions for x = 0 to be 
unstable for /x > 0 and stable for ^ < 0. 

Repeat the exercise for the case (0, 0)/ (0, 0)^ < 0. 

8. In Exercise 6 in Chapter 18 we computed center manifolds near the origin for the 
following one-parameter families of maps. Describe the bifurcations of the origin. In, for 
example, a) and a^) the parameter s multiplies a linear and nonlinear term, respectively. 
In terms of bifurcations, is there a qualitative difference in the two cases? What kind 
of general statements can you make? 



V ^ 2 

X 1-^ X — y — xy , 

1 2 

y — — X ey X , 

1 2 

X 1-^ ~ 2^ ~ y ~ 5 

y i-j- -iy -h -k x^. 



(x,y) G 
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b) 

b') 

c) 

c') 

d) 

d') 

e) 

e') 

f) 

f') 



■ x + 2y + X , 

■ 2x + y + ey, 

■ X + 2y + x^ , 

■ 2x + y + sy^ . 

■ —X + y — xy'^ , 

■ y + sy + x'^y, 

■ —X + y — xy'^ , 

■ y + ey^ + x^y. 

■ 2x + }/, 

■ 2x + 3y + 5x + x^ 

■ 2x + y + ex ^ , 

■ 2x + 3y + 

■ X ey, 

■ x + 2y + y^ , 

■ X -\- ey , 

■ x + 2y + y^ . 

■ 2x + 3y, 

■ X ey x“^ xy'^ , 

■ 2x + 3y, 

■ X + x"^ + ey^ + xy^ . 



(x,y) e 1 



(x, y) e 1 



{x, y) e ; 



{x,y) e 1 



(x, y) e 1 



■ X — z , 

■ 2x — y ey, 

I 1 I 3 

X -\ Z X , 

2 



X X — z 



h) 



X X ex z 



h') y' 

Z h 

i) 

y ' 

y ' 

j) 

y » 
j') 



{x, y, z) G 1 



■ 2a; — y + ey , 

I 1 I 3 

X -\- —Z X . 



■ X ez , 

r> 3 

—X — 2y — X , 

1.2 

y - -2 + y > 



{x, y,z) G] 



(^>y) ^ • 



—X — 2y — X , 
1.2 
y - -2 + y • 

■ y -\- ex -\- x"^ , 

■ y + xy, 

■ y-\- X , 

■ y -\- xy + ex . 



(^.y)6 5 

■ y + a:y, 

2 I 

• £c + ey, 

■ y + xy. 



9. Center Manifolds at a Saddle-node Bifurcation Point for Maps 

Following the discussion in Exercise 7 following Section 20.1, develop the center man- 
ifold theory for maps so that it applies at a saddle-node bifurcation point. 

Apply the resulting theory to the following maps. In each case, compute the center 
manifold and describe the dynamics near the origin for e small. Discuss the bifurcations 
that occur (if any) at e = 0. 
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. I I 2 2 

X X e X X — y 1 
,2,2 
y X + y , 

V I .22 

IN. X e + sx -\- X — y , 
DJ .3,2 

y ^ X +y . 

V I I 2 2 

a^i-^e + ai + a: — y ■, 
y e x^ y^ . 

1 2 

a; I— >• e H — X — y — X , 

1^2 
y '->■ -2: + y ■ 

>, xi-¥e — x-\-y-\-x^, 

2 

y y £x — X . 

cN. a: 1-^ e + ai + eai + , 

. I o 2 

y I— >• a; + 2y — ai . 

a:; 1-^ e - rc + a:;y + y^, 

V o 2 

y 2x — xy — y . 



(ai, y, e) G 



xi-^e-\-2x-\-y + x^y, 
y I-4- 12a: + 3y — xy'^ . 



10. For the Naimark-Sacker bifurcation, compute the expression for the coefficient a anal- 
ogous to (20.2.14) for vector field. {Hint: the answer can be found in Guckenheimer 
and Holmes [1983]). 

11. Consider the following (r > 1) two-dimensional, time-periodic vector field 



X ^ y, 
V = 



(x,y) e R^, 



where f(x, t) has period T in t. 

a) Show that the vector field has a (time-dependent) first integral and that the 
first integral is actually a Hamiltonian for the system. 

b) Suppose that the vector field is (constantly) linearly damped as follows 

^ ^ V' A \ n 

y = -<5y + f{x, t), 



Show that the associated Poincare map cannot undergo Naimark-Sacker bifur- 
cations. 



12. Consider a map of having a fixed point at the origin where the eigenvalues associated 
with the linearized map a^ complex conjugate and of unit modulus. We denote the 
two eigenvalues by A and A. The goal of this exercise is to study the dynamics near 
the origin in the cases 

A^ ^ 1, q ^ 1,2, 3, 4. 

We will begin by developing a very powerful local technique; namely, interpolating a 
map by a flow. 

a) Prove the following lemma (Arnold [1983]). 

Consider a (r as large as necessary) mapping /:R^ R^ having 
a fixed point at the origin with the eigenvalues of the linearization 
at the origin given by (and with a Jordan block of order 

2 if q — 1 or 2). In a sufficiently small neighborhood of the origin 
the iterate f^ can be represented as follows 

f^(z) = (pi{z) + 0{\z\^), z e R^, 
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Next 

b) 



c) 



Now 

d) 



where <pi{z) is the time one map obtained from the flow generated 
by a vector field, v{z). Moreover, the vector field is invariant under 
rotations about the origin through the angle 2Tzfq. 

(Hint: first put f^ in normal form 

/^ — Az + F2 ( 2 ) + F 3 ( 2 ) + - - - + + 0 {\z\ ). 

Next consider the vector field 

2 = (A — id )2 + F2 (z) + F3 ( 2 ) + - - - + Fj^_^{z).) 

Approximate the time one map of this vector field via Picard iteration (justify 
the use of this method), and show that it gives the desired result. This will show 
why it is first necessary to put the map in normal form. Moser [1968] gives an 
alternate proof of this lemma as well as a nice discussion of interpolation of 
maps by fiows. 



we must deal with versal deformations of these maps. 

Prove the following lemma (Arnold [1983]). 

Consider a deformation f\, X G of a mapping fQ — f satisfy- 
ing the hypotheses of the lemma in Part a). In a sufficiently small 
neighborhood of the origin, the iterate f^ can be represented as 

fl(z) = + 0{\z\^), zGK^, 

where (pi^\{z) is the time one map obtained from the flow gener- 
ated by a vector field, v\, that is invariant under rotations about 
the origin through the angle 2T^jq. Moreover, (pi^o{z) — <pi{z) and 
Vo{z) = v{z). 

{Hint: this lemma is a consequence of the fact that the normalizing transforma- 
tions, up to order N , depend differentiably on the parameters.) 

A vector field on the plane may have 

1. Fixed points (hyperbolic and nonhyperbolic) . 

2. Periodic orbits. 

3. Homoclinic orbits. 

4. Heteroclinic orbits. 

Suppose the time one map of a vector field having all of these orbits approximates 
the map in the sense of the lemmas in Part a) and b). How would each of 
these orbits be affected by the higher order terms (i.e., the 0 (| 2 |^) terms)? 



TO return to the main problem. 

Show that the normal forms in the cases = 1, q — 

^ ' y y F F bxy -h • • • , 

X X F y ^ , 

q — , I 3 I , 2 , 

y y F dx F bx y F ■ ■ ■ , 

q — 3 : z z F ci'z^ F 022^2 , 

q — z z F ci 2 ^ -|- C 22^2 



1, 2, 3, 4, are given by 
(x,y) G R^. 

(x,y) G R^. 
z G C. 
z G C. 



Compute the codimension in each case. Argue that candidates for versal defor- 
mations are given by 



g = 1: 
q = 2: 

q = 3: 
q = 4- 



X X F P', 

y 1-^ y.1 + y.2V + y + ax^ -|- bxy, 



(x, y) e : 



X 1-^ x + y, (\cz- 

y ^ fiix + {I + y.i)y + ax^ + bx'^ y , \^’V) 

z G C. 



(1 + fl)z + CiZ^ -t- C 2 Z^Z, z G C. 
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g) 

h) 



Show that the vector fields that interpolate through the order given in Part 
e) are 



x = y, ^ 

V = + ax + bxy, 



q=l-. 
q = 2-. 

O' — 3: z — fiz + c\z^ + C 2 Z^^ 



X — y. 

^ ’ 3 2 

y = fj.ix + fi 2 y + ax + bx y 



(x,y) G 
(x, y) G 1 

z G C. 



q — A\ Z — flZ + CiZ^ + C22"^2, 



2 G ( 



Describe the complete dynamics of each of the vector fields in Part f). 

Using the results from Parts g) and c), describe the dynamics of near the 
origin for q — 1,2, 3, and 4. 



We remark that the results of this problem were first obtained by Arnold [1977], [1983]. 
A very interesting application to a vector field undergoing a Poincare-Andronov— Hopf 
bifurcation that is subjected to an external time-periodic perturbation can be found 
in Gambaudo [1985]. 



13. 



Consider the following (r as large as necessary) one-parameter family of vector 
fields 

X = f(x, fi), {x, fi) ^ X . 

Suppose this vector field has a fixed point at — (0,0). 



a) Suppose n = 1; can (ic,/j.) = (0,0) undergo a period-doubling bifurcation? 

b) Suppose n — 2‘, can {x,^) — (0,0) undergo a period-doubling bifurcation? 

c) Suppose n = 3; can {x,^) — (0,0) undergo a period-doubling bifurcation? 

{Hint: consider a linear vector field 

X — Ax, X G , 



where the flow is given by 

At 

X — e 

and det > 0 for finite t. Use these facts.) 



^ 0 , 



21.6 Maps of the Circle 



As we saw in the previous section, the Naimark-Sacker bifurcation gave rise 
to an invariant circle. Hence, the restriction of the map to the invariant 
circle gives rise to a circle map. In this section we want to establish some 
of the basic properties of circle maps. First, we establish the setting and 
notation. 

We consider orientation-preserving homeomorphisms of the circle, S^, 
into itself: 

/ : M ^ S\ 

Our study of circle maps will be aided through the notion of a lift. This 
device will enable us to consider the circle map as defined on the real line 
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(constructed from the periodic repetition of the unit interval), where issues 
such as differentiability and order preservation are straightforward. We now 
define the lift of a circle map. 

Definition 21.6.1 (The Lift of a Circle Map) Consider the following 
map of the real line M to : 

X = e. 



The map F 






UoF= foU. 



Through the lift, the property of orientation preservation is manifested by 
F being an increasing function of x, i.e, x\ > X 2 implies F(xi) > F{x 2 )- 



Some authors choose to work with a circle of length one, others choose to 
work with a circle of length 27 t. The difference involves making sure the 
appropriate factors of 27 t are in the right places in the various formulae. 
For example, in the map 11 used in the definition of the lift the factor 
27 t in the exponent of the exponential implies that we are working on a 
circle of length one. 



We now consider some examples of lifts. 

Example 21.6.1. Consider the circle map 

f-.S^^ S\ 

6 — > 6 + 2-kijJ. 



Then, a lift F of the circle map / satisfies 

/ TT 2'Ki(x-\-u}') TT m 27rif(x) 

oIl = e ^ ^=rioF = e 

from which it follows that, 

F(x) = X + ui + k, where k is some integer. 



End of Example 21.6.1 
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Example 21.6.2. Consider the circle map 

S\ 

6 — > 9 + £sin9. 



Then, a lift F of the circle map / satisfies 

/ TT 27 ri( j;+ sin 27 ra: ) -nr m 27 riF'(x) 

oil = e / = n o F = e ^ 

from which it follows that, 

F(x) = X + sin27ra; + k, where k is some integer. 



End of Example 21.6.2 



A key quantity in describing the dynamics of circle maps is the rotation 
number associated with a given circle map. We now develop a series of 
lemmas that will be used in proving some of the important properties of 
the rotation number. 

The following lemma shows that two lifts of the same circle map differ 
by an integer. 



Lemma 21.6.2 Let f : be an orientation preserving homeo- 

morphism of the circle and let Fi and F 2 be lifts of f. Then Fi = F 2 + k, 
where k is some integer. 

Proof: The two lifts must satisfy 

/on = noFi = 

/on = noF2 = 



from which it follows immediately that Fi = F 2 + k, where k is some 
integer. □ 

The following lemma shows that the iterate of the lift of a circle map is 
the lift of the iterate of the circle map. 
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Lemma 21.6.3 If F is a lift of f then is a lift of n > 1. 

Proof: By definition of a lift we have 

UoF= foU, 

and therefore 

noFoF = noF2 = /onoi^ = /o/on = /^on. 

Repeating this argument establishes the result for arbitrary n. □ 



Lemma 21.6.4 Let f : be an orientation preserving homeo- 

morphism of the circle and let F be a lift. Then 

F{x + k) = F{x) + k 



for any integer k. 



Proof: We first consider the case k = 1. 

Using the definition of the lift we have 

^27,iF{x+i) = n o F(x + 1 ) = / o n(x + 1 ) = / o n(x) = n o f{x) = . 

Thus, 

^2-KiF{x-\-l) ^2-KiF{x) 

which implies that 

F{x+ 1) = F{x) +j, 

where j is some integer. We now argue that j = 1. 

We give a proof by contradiction that j = 1. Suppose that j > 1. Then 
since F(x) is an increasing function there must exist a point, y, with x < 
y < X + 1, such that 

F{y) = F{x) + m, 1 < m < j. 



( 21 . 6 . 1 ) 

( 21 . 6 . 2 ) 

(21.6.3) 



Then 

/ O n(y) = = / o n(x). 
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from which it follows that 

y = X + m, 

where m is some integer ^ 0. This contradicts our choice of y. 

The case for a general integer k is proved similarly. We can replace a: + 1 
by x + k in (21.6.1). Then we can conclude that F{x + k) = F{x)+j, where 
j is some integer. Applying the same argument allows us to conclude that 
j = k. □ 



Lemma 21.6.5 Let f : be an orientation preserving homeo- 

morphism of the circle and let F be a lift of f. Then F" — id is a periodic 
function with period one, n>l. 



Proof: First prove the lemma for n = 1. Using Lemma 21.6.4, we have 
F{x + 1) — (a; + 1) = F{x) — x. 

This proves the lemma for n = 1. 

Now consider the case n > 1. By the chain rule, it follows that /” is also 
an orientation preserving homeomorphism of the circle, and by Lemma 
21.6.3 F" is a lift of /". Then using Lemma 21.6.4, it follows that 

F"(x + 1) - (a; + 1) = F”(a;) - a:. 

This completes the proof of the lemma. □ 



Lemma 21.6.6 Let f be an orientation preserving homeomorphism of the 
circle with lift F. Then there exists an integer kn such that 

kn < F”(a;) — x < + 3, 



for every x G M, n > 1 . 



Proof: By Lemma 21.6.5, since F”(x) — x is periodic with period 1, the 
result will follow if it is established only for x G [0, 1]. 

Now F"(0) is between two integers m and m+1, i.e., m < F"(0) < m+1. 
From Lemma 21.6.4 it then follows that m+ 1 < F"(l) < m + 2. Since F is 
increasing it follows that to — 1 < F"(x) — x < to + 2. If we let = to — 1 
the result follows. □ 
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Lemma 21.6.7 Suppose f : is an orientation preserving home- 

omorphism and F is a lift. Then if\x—y\ < 1, we have |F"(a;) — F”(y)| < 1, 
n > 1. 



Proof: Now |a; — y| < 1 implies that — 1 < a; — y < 1, or 



1. a: < 1 + y, 

2. y < 1 + a;. 



Using Lemma 21.6.4, and the fact that F is increasing, we apply F" to 
each of these expressions to obtain 

1. F"(x) < F”(l + y) = F"(y) + 1 ^ F”(a;) - F"(y) < 1, 

2. F"(y) < F"(l + x) = F”(x) + 1 ^ -1 < F”(x) - F”(y). 



Which establishes the desired result. 



□ 



We are now at the point where we can develop the definition of rotation 
number. For an orientation preserving homeomorphism / : — > S^, with 

F a lift of /, we define the quantity 



Po{F) = lim 

n—^oo 



n 



First, we show that if this quantity exists, it is independent of x. 



Lemma 21.6.8 If pq{F) exists, then it is independent of x. 



Proof: Let x and y be given points. Then y can be written in the form 



y = y + fc. 



where k is an integer and |x — y| < 1. Using this representation for y, we 
have 

|F"(x) - F"(y)| = |F"(x) - F"(y + k)\ = |F"(x) - F"(y) - k\, 

< |F"(x)-F"(y)| + fc. 
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Using Lemma 21.6.7, since |a; — y| < 1 then |U”(x) — U”(y)| < 1. So we get 
|F”(a;) - F"(y)| ^1 + k 



and therefore 



n 



\F-{x)-F^{y)\ ^ ^ 

n—^oo fi 



This completes the proof. 



□ 



Example 21.6.3. Consider the circle map 

f :S^ — > 

6 — > 6 + 27rt<; 



with lift 



F -.M} — > 

X — > a: + w + fc. 



Then we have 



Po(E) 



X -\- nto -\- nk 

lim = u + K. 

n—¥oo tl 



End of Example 21.6.3 



Lemma 21.6.9 Let f : be an orientation preserving homeo- 

morphism and let F\ and F 2 be lifts such that Pq{F\) and ^0(^2) exist. 
Then po{Fi) = po(^2) + k, where k is an integer. 



Proof: From Lemma 21.6.2, 



Fi = F 2 + k. 



Let Tfc denote the translation map, i.e., : x x + k. Then we can 

rewrite this equation as 

Fi = TkO F2. 



Now we want to show that Tk commutes with F 2 . We have 

7^2 o Tfe = F 2 {x + k) = F 2 {x) + k, by Lemma 21.6.4, 
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and 

Tk o ^ 2 ( 3 ;) = F2{x) + k. 



Hence 



F2 oTk = Tk o F2. 



We use this result in the following way. Since F\ =TkO F 2 it follows that 

Ff(a:) = (Tfc o F 2 )” (x), 

= {Tk o F 2 ) o (Tfc o F 2 ) o • • • o {Tk o F 2 ){x), 

^ ■■V" 

n fold composition of Tk o F 2 

= F 2 oT^(x), where we have repeatedly commuted “pairwise,” 

Tk and F 2 
= F^(x + nk), 

= F^lx) + nk, by Lemma 21.6.4. 



Hence 

F” = F^ + nk, 

From which it immediately follows that 

Po{Fi) = p{F2) + k. 

This completes the proof. □ 

We now prove that po{F) exists. 



Theorem 21.6.10 Let f : be an orientation preserving home- 

omorphism with lift F. Then 



Po{F) 



lim 



\F-{^)\ 

n 



exists and is independent of x. 



Proof: We break the proof down into two cases. 



Case 1: 



Suppose / has a periodic point, i.e., there exists 9 £ such that /’"(6*) = 
9. Then 

F™(x) =x+fc, 
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for some fixed x, where k is some integer. Therefore 

= x + jk, 



from which it follows that 



lim 

j->oo 



\F^^{x)\ 

jm 



lim 

j->oo 



X + jk 

jm 



A 

m 



Any integer n can be written in the form 

n = jm + r, 0 < r < m 



and by Lemma 21.6.6 there exists a constant M such that 

\F^{y)-y\<M, 



for all y G M, 0 < r < m. So we have 

I A" (a;) - F3'^{x)\ _ |F''(F^™(a;)) - 

n n 

M 

< — , 

n 



from which it follows that 



,i„ !£!M = 



= lim 



X + jk 



i->oo jm + r jm + r 



A 

m 



So, po{F) exists whenever / has a periodic point, moreover, in this case 
Po{F) is rational. 



Case 2: 



Now we consider the general case. By Lemma 21.6.6 there exists an 
integer such that 

kn < F"'{x) — X < kn + 3, (21.6.4) 

for every cc G M, n > 1. Applying (21.6.4) repeatedly to x = A"(x), J^^”(x), 
...^ jr(m-i)" 

(x), we get the following chain of inequalities 
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kn < F"‘{x) - X < kn + 3, 

kn < F^^{x) - F^{x) <kn + 3, 

kn < F^^{x) - F^^{x) < fc„ + 3, 
kn < F”^"{x) - < fc„ + 3. 



Adding these inequalities gives 



mkn < F™"(x) — X < m{kn + 2), 



or, 



kn F™"(x) -X kn + 3 

n mn n 



The first inequality in the chain of inequalities gives 

K ^ ^ fcn+3 

n n n 

Combining these two expressions gives the estimate 



F™(x) F"(x) 



3 

< - 

mn n \ n 

Repeating the above argument with m and n interchanged gives 



F'^{x) 



3 

< — 
m 



(21.6.5) 



( 21 . 6 . 6 ) 



Combining (21.6.5) and (21.6.6) gives 



F”(a;) F'^{x) 



m 



< 



n + m 



Hence 



the sequence | ^ | is a Cauchy sequence in M, so it converges. 



□ 



Finally, we define the rotation number of an orientation preserving, 
homeomorphism of the circle. 



Definition 21.6.11 (Rotation Number) There are two main opera- 
tional definitions of rotation number. For f : an orientation 

preserving homeomorphism, with F a lift of f: 

1. Some authors define the rotation number of f, denoted p{f), as the 
fractional part of po{F) (e.g. Devaney [1986]). 
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2. Other authors define the rotation number of f to be po{F) (e.g., Katok 
and Hasselblatt [1995]). 



In either case, we have shown that the rotation number exists, and is inde- 
pendent of the point x. 



Next we show that that rotation number depends continuously on / in 
the topology. 



Theorem 21.6.12 Let f : be an orientation-preserving dif- 

feomorphism. Let e > 0 be given, then there exists a (5 > 0 such that if 
g : is also an orientation preserving diffeomorphism which is 

— 6 close to f then \p{f) — p{g)\ < e- 



Proof: Choose n such that f < e- From Lemma 21.6.6, there exists kn such 
that 

kn < i^”(0) < fc„ + 3 

for F some lift of /. Choose S sufficiently small so that for some lift G of g 
we also have 

k„ < G"(0) <kn + 3 

Utilizing the same argument as in Theorem 21.6.10, we obtain the inequal- 
ities 

^ ^ ^ k„ + 3 

n mn n 

kn ^ ^ kn+3 

n mn n 

Combining these inequalities gives 

< _ < g 

mn mn n 



and the result follows since 



Po{F) = lim 

n— >-oo 



F™”(0) 

mn 



and 



po(G) = lim 

n—^oo 



G™(0) 
mn ’ 



and the limit is independent of the point. 



□ 
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Theorem 21.6.13 The rotation number is irrational if and only if f has 
no periodic points. 

Proof: See Devaney [1986]. □ 

Theorem 21.6.14 The rotation number is rational if and only if f has a 
periodic orbit. 



Proof: In our proof of the existence of the rotation number we have shown 
that if / has a periodic point, then the rotation number is rational. It re- 
mains to show that if the rotation number is rational, then / has a periodic 
point. We leave this to the exercises (or see Katok and Hasselblatt [1995]). 

□ 



Here we introduce some terminology. Suppose the rotation number is 
rational, say -, with p and q relatively prime integers. We refer to the 
associated periodic orbit as a “p — q” periodic orbit (or a periodic orbit 
of type “p — q”). In terms of the orbit of the point, it is a period q orbit 
that makes p revolutions around the circle before returning to its original 
starting point. 

The rotation number is invariant under orientation preserving topological 
conjugacy. 



Theorem 21.6.15 Let f and g be orientation preserving homeomorphisms 
of S^, then p{f) = p{g~'^fg). 

Proof: See Katok and Hasselblatt [1995]. □ 



The Rotation Number and Orbits 

While we have seen that an orientation preserving circle homeomorphism 
has periodic orbits if the rotation number is rational, and it has no periodic 
orbits if the rotation number is irrational, we have not characterized all 
possible orbits for the two cases. This is done below, and is referred to as 
the “Poincare classification” by Katok and Hasselblatt [1995], where the 
proof of the statements can be found. 
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Rational Rotation Nnmber, - 

For any given initial condition, there are three possibilities for the orbit. 



• A ^ periodic orbit. 

• A homoclinic orbit. The orbit asymptotically approaches a periodic 
orbit as n — >■ — oo and as n — >■ +oo. 

• A heteroclinic orbit. The orbit asymptotically approaches a periodic 
orbit as n — >■ — oo and a different periodic orbit as n ^ +oo. 



Irrational Rotation Number 



For any given initial condition, there are three possibilities for the orbit. 



• An orbit that densely fills the circle. 

• An orbit that densely fills a Cantor set on the circle. 

• An orbit that is homoclinic to a Cantor set on the circle. 



21.6a The Dynamics of a Special Class of Circle 
Maps- Arnold Tongues 



The results above are rather general. Now we will study a more specific 
class of circle maps and obtain more detailed information on the dynamics. 
In particular, we want to discuss the notion of phase locking and Arnold 
tongues. The following discussion loosely follows Hall [1984]. 

Consider the following two-parameter family of diffeomorphisms of 
the circle: 

0 — {0 + (l) + aj(0)} 

where (•) denotes the fractional part, and we have the following assumptions 
on 7 : — >• 



1. 7 G CC r > 1, 



^7 

dO 



< 1 , 



2. V 6» G M, 7(0+1) = 7(0) + 1 
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1 

3. fj(d)dd = 0. 

0 

We will study the following lift of the above circle map: 
f : 9 — >9 + (l) + aj(9) = f{9, (j), a). 

An example of a map of this type is given by 

a . 

X 1-^ X + (p + — sin ZTTx, 

ZTT 

which is often referred to as the standard map, which was studied in detail 
by Arnold [1965]. 

The two parameters are <p and a. For a = 0 the map is just a rigid, linear 
rotation through the angle (p. The parameter a controls the nonlinearity of 
the map. 

We denote the rotation number of the lift by p{(p,a). 

Theorem 21.6.16 For ^ G M, a G [0, 1) 

1. p{(p,a) exists and is independent of 9. 

2. p{(p,a) is continuous in {(p,a) and nondecreasing in (p. 

3. If p{(p,a) = withp,q relatively prime positive integers, then there 
exists 9 G [0, 1) such that f^{9, (p,a) = 9 + p. 



Proof: 



1. In order for this to be true we need only show that / is orientation 
preserving and increasing, i.e., % > 0- We have 



d9 



{9,<p,a) = 1 + 



Since 



d'y 

~de 



< 1 and a G [0, 1) the result follows. 



2. We have already established that p{(p, a) depends continuously on 
parameters (cf. Theorem 21.6.12). 
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We now show that a) is nondecreasing in <j) for each fixed a. 

Fix a. For <f> 2 > <pi, we have 

f{0, 4>2, a) = 9 + 4)2 + aj(9) > f{0, 4>i,a) = 9 + 4>i + a"f{9). 
Then 

r{0,4)2,a)> r{9,4>i.a), 

from which it follows that 

p{4>2,a) > p{4)i,a). 

3. Half of this result was established in Theorem 21.6.10, the other half 
was left as an exercise. 

□ 



The following lemma will be useful. 

Lemma 21.6.17 For any integer q, 

q-l 

P{9, 4>,a) = 0 + q4> + a ^ 7 ( 6 * + j4>) + ah{0, 4>, a) 
j=o 

where h is as differentiable as 7 and — >■ 0 as a — >■ 0. 



Proof: This just involves a direct calculation while introducing the correct 



notation. 




( 7 = 1 : 


f{ 0 , 4 ),a) = 0 + (/) + 7(0), 


(7 = 2: 


/^(0, 4>,a) = 9 + 24> + 07(0) + 07(0 + 4> + 07(0)), 


q = 3 : 


/^(0, 4,ct) = 0 + 34 + 07(0) + 07(0 + 4 + 07(0)) 

+ 07(0 + 24 + 0(7(0) + 7(0 + 4 + 0:7(0)))) 



Continuing in this manner, the gth iterate has the form 



f^{0, 4,a) = 0 + q(p + a j{0) + 7(0 + 4> + aj(9)) + j (0 + 24 > + a(j(9) 



+ 7(0 + 4 > + 07(0)))) 

+ 7 (0 + 3(/) + 0(7(0) + 7(0 + 4 + 07(0)) + 7(0 + 2 , 4 ) + 0(7(0) 
+ 7(0 + (/)+ 07(0)))))) H 
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+ 7 (0 + ((? — !)</!) + a{l{0) + 7(0 + (/» + oi"i{9)) + • • • 
+ 7 (^ + (<7 — 2 )(/) + a{j{6) + • • • + 7(0 + ((? — 3)(/) 

+ « 7 (" •))))))! 



Since 7 is C^, we can Taylor expand a general term of the form 
7(0 + k(j> + aG{6, (f>, a)) = 7 ( 6 * + k<j>) + </>, a). 

Using this in the expansion for gives 



q-l 



4>,a) = 9 + qcj) + a ^ 7(0 + j</>) + ah{9, <f>, a) 
i=o 



with h € G^ and ft, — >■ 0 as a — 1 0 . 



□ 



Definition 21.6.18 For /3 G K we define the set 

Af} = {{(j),a)\(j) eR,a G [0, 1), p(^, a) = P}, 

(note that t = P)- When P = ^> Ap is called an Arnold Tongue. 

If the rotation number is rational the map is said to he phase-locked or 
mode-locked. 



Theorem 21.6.19 (Existence of Arnold Tongues) For each rational 
I there exists Lipschitz functions pi, p 2 - [0, 1) — f ® such that 



1. y a G [0,e), pi{a) < p 2 {oi), 

2. pm = P 2 P)) = 

3 . (p,a) G Ap if and only if pi (a) < p < p 2 {ce). 



The Arnold tongue is illustrated in Figure 21.6.1. 
Proof: Consider the equation 

G{9,p,a) = r{9,p,a)-9-p=0. 



For fixed p and a, a solution 9 of this equation corresponds to a periodic 
point of period |. Now for a = 0, (/) = | is a solution, for all 9. We will use 
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the implicit function theorem to “continue” this solution for a > 0. To do 
this, we must compute the following derivative 



d(j) 



{9,(1), a) 



d(j) 



{9, (/>, a) 



= q + a 



'q-l 



■j(l)) 



\j=o 



ah' {9, <j), a), 




FIGURE 21.6.1. The Arnold tongue Ap. 

q 




FIGURE 21.6.2. Graph of the function <j}{6,a) for some hxed 9. 

which is clearly nonzero at a = 0, </> = 9 G [0, 1). Moreover, since jy'j < 1, 

\h'\ < a, it is also nonzero for O<a<l,0G[O,l). Then by the implicit 
function theorem there exists a unique function (j>{d, a) with 0 G [0, 1), 
a G [0, 1) such that 
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G{9, (j){e, a), a) = ^(6», a),a) - 9 - p=Q, 



with </>(6*,0) = see Figure 21.6.2. 

We now take 

4>i{a) = inf 4>{9,a), 4>2{cx) = sup (p(9,a), 

»6[o,i) ee[o,i) 



where </>i(0) = ^ 2 ( 0 ) = For fixed a, the values of (f) satisfying <t)i{a) < 
4> ^ 4>2{ci) correspond to parameter values for which a) = ^. 

Now we argue that the curve generically opens at ^ as a increases, i.e, 
as a increases from zero, 4>i{a) < 4>2{a). Implicitly differentiating the ex- 
pression 

P{6, a), a) - ^ - p = 0, 



gives 



or 



^ ^ 

da d(j) da da 



d(j) 

da 



dfl 

doL 

df^ 

d(j) 



Using the expression for given in Lemma 21.6.17, we obtain 

9-1 



d(j) 

da 



a=0 



= --E 7 

^ j=o 



' + J- 

q 



From our assumptions, j{9) has zero average. It then follows that the av- 
erage of f^|a=o with respect to 9 is zero. Therefore ^{9,0) takes on both 
positive and negative values. Since generically |^(0,O) is not identically 
zero, we have 4>i{a) < ^ 2 ( 0 ), at least for a near zero. □ 



The Opening of the Arnold Tongues 

We have shown that generically the Arnold tongues open as a increases 
from zero. In our figures we have illustrated the opening getting wider and 
wider as a increases to one. However, we stress that this is really just artistic 
license; we have not proved that the curves behave in this way. McGehee 
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and Peckham [1995] numerically compute “global” Arnold tongues for a 
number of examples. Their paper also has a nice literature survey on the 
subject. 



Nonresonance 

The Arnold tongues characterize the region in a — ^ for which the rotation 
number is rational. Herman [1979] has proved a theorem which character- 
izes irrational rotation numbers. 

Theorem 21.6.20 (Herman) For a given irrational number rj there ex- 
ists a Lipschitz curve ipr^ : [0, 1) — > K such that p{4>, a) = rj if and only if 

(t> = f/'r,(a)- 

In other words, given an irrational rotation number, it must lie on a 
Lipschitz curve corresponding to parameter values which give the same 
irrational rotation number, see Figure 21.6.3. 




FIGURE 21.6.3. Illustration of an Arnold tongue and a curve <j> = cor- 

responding to parameter values for which the rotation number is the irrational 
number g. 



The Devils Staircase 

Suppose we fix a and graph p{(p, a) as a function of (j). The rotation number 
p{(j),a) is a continuous function of <j) and a (Theorem 21.6.12) and it is 
an increasing function of 4 > (Theorem 21.6.16). The graph is constant on 
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intervals corresponding to rational rotation numbers, and increasing at the 
points corresponding to irrational rotation numbers. The graph appears as 
in Figure 21.6.4 and is referred to as a Devils staircase. 



The Lebesque Measure of the Set of Parameters 
Corresponding to Rational and Irrational Rotation Numbers 

Consider the rotation number p(a,(f>) for some fixed 0 < a < 1. For a = 
0 the rotation number is rational for all rational values of </>, and it is 
irrational otherwise. Hence the Lebesgue measure of (p values corresponding 
to rational rotation numbers is zero and the Lebesgue measure of (p values 
corresponding to irrational rotation numbers is one. 

P (<|), a) 



FIGURE 21.6.4. The devils staircase. The graph of p{a, <p) as a function of p, for 
some fixed 0 < a < 1. 

As a increases from zero (but still remains “small”), one obtains a set of 
<p values having positive measure corresponding to rational rotation num- 
bers. However, the Lebesgue measure of this set is 0{a), and most (in the 
sense of Lebesgue measure) values of p correspond to irrational rotation 
numbers. The set of p values corresponding to irrational rotation numbers 
is a Cantor set; it contains no intervals. From the point of view of applica- 
tions this seems like a strange situation, since for a small, a p value chose 
at random will most likely give rise to an irational rotation number, but 
it can be converted to a rational rotation number by an arbitrarily small 
perturbation. 

Rigorous results on the Lebesgue measure of rotation numbers can be 
found in Herman [1977]. 
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Phase Locking in a Continuous Time Setting 

We remark that phase locking for ordinary differential equations has pre- 
viously been studied by Loud [1967] and Bushard [1972], [1973]. 



21.6b Exercises 

1. Prove Theorem 21.6.13. 

2. Prove that if the rotation number of an orientation preserving homeomorphism of the 
circle is rational, then it has a periodic orbit (thus completing the proof of Theorem 
21.6.14). 



3. Prove Theorem 21.6.15. 

4. Prove that the curves (pi{oi) and (p 2 {ct) constructed in Theorem 21.6.19 are Lipschitz 
in a. 

5. Prove that the width of the opening of Ap constructed in Theorem 21.6.19 is O(a^). 

«? 

6. Prove that parameter values on the two bounding curves of the Arnold tongue Ap (i.e., 

q 

values of 4> and a on the curves and <f> 2 {ct)) generically correspond to saddle- 

node bifurcations of ^ periodic orbits. Hint: apply the implicit function theorem to 
the function 




FIGURE 21.6.5. 



F{0,4>,a) 



<p,a) - e - p, -^(0, <p,a) - 1 

oO 



7. In this exercise we will compute a few of the Arnold tongues explicitly. The results 
should give some idea of how the neglected higher order terms in the normal form can 
affect the dynamics on the invariant circle arising in a Naimark— Sacker bifurcation. 

Consider the two-parameter family of maps 

X X fj, -\- e cos 27ra: = f{x, fi., e), £>0, (21.6.7) 

where we identify points in Ri that differ by an integer so that (21.6.7) can be regarded 
as a map defined on the circle — R^/Z. 
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a) Discuss the orbit structure of (21.6.7) for e = 0. In particular, what is the 
Lebesgue measure of the set of parameter values for which (21.6.7) has periodic 
orbits? 

b) Consider the following regions in the — s plane (see Figure 21.6.5). 



— 1 zt s, 



M ^ - ± £ - + 0{s ), 

1 2 V3 . 3 >/77T 4 

M = - + £ -—7z±e — hO(e ). 

3 6 6 



Show that, for parameter values in the interior of these regions, (21.6.7) has a 
period 1, period 2, and period 3 point, respectively. 

Hint: we outline the procedure for the period 2 points. 

i. If X is a period 2 point of (21.6.7), then 



f^(x, ^,e) — X — 1 — G{x, /i, e) — 0. 

ii. If ^ 0, then we have a function — ^{x, e) such that 



G{x, fi{x, s), s) — 0. 

iii. Expand the function fj.{x,e) as follows 

0 ^2 q2 

fi(x,e) = ^l{x, 0) + £^(a;, 0) + 0) + 0(e^ 

oe 2 

iv. Implicitly differentiating (21.6.8), show that 



(21.6.8) 



K^,o) = 



dfi 

ds 

ds^ 



(a^, 0) = 0, 

(a:, 0) = 27rsin47ra:. 



V. Taking the infimum and supremum in Step 4, we obtain 



fi ^ ^{x, e) ^ - ± £ - + 0(£^). 



Justify all steps completely. 

Now let us return to the setting of the Naimark— Sacker bifurcation. For £ = 0, 
(21.6.7) has the form of the truncated Naimark— Sacker normal form restricted 
to the invariant circle. The term £cos27ra: could be viewed as illustrating the 
possible effects of higher order terms in the normal form. For (21.6.7), at £ = 0 
the map has periodic orbits for all rational (i.e., a set of Lebesgue measure 
zero). For e small and fixed, our results show that the measure of the set of fi 
values for which (21.6.7) has a periodic orbit is positive. Thus, based on this 
example, we might expect the higher order terms in the Naimark— Sacker normal 
form to have a dramatic influence on the dynamics restricted to the bifurcated 
invariant circle. See looss [1979] for more details. 

8. For some fixed 0 < £ < 1, compute the devils staircase for the following map 
x X ^ e cos 27 T£c = /(a:, /i, £) , 



where a: G . 
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On the Interpretation and 
Application of Bifurcation 
Diagrams: A Word of Caution 



At this point, we have seen enough examples so that it should be clear 
that the term bifurcation refers to the phenomenon of a system exhibiting 
qualitatively new dynamical behavior as parameters are varied. However, 
the phrase “as parameters are varied” deserves careful consideration. Let 
us consider a highly idealized example. 



The situation we imagine is wind blowing through Venetian blinds hang- 
ing in an open window. The “parameter” in this system will be the wind 
speed. From experience, most people have observed that nothing much 
happens when the wind speed is low enough, but, for high enough wind 
speeds, the blinds begin to oscillate or “flutter.” Thus, at some critical pa- 
rameter value, a Poincare- Andronov-Hopf bifurcation occurs. However, we 
must be careful here. In all of our analyses thus far the parameters have 
been constant. Therefore, in order to apply the Poincare-Andronov-Hopf 
bifurcation theorem to this problem, the wind speed must be constant. At 
low constant speeds, the blinds lie flat; at constant speeds above a certain 
critical value, the blinds oscillate. The point is that we cannot think of the 
parameter as varying in time, e.g., wind speed increasing over time, even 
though this is what happens in practice. Dynamical systems having param- 
eters that change in time (no matter how slowly!) and that pass through 
bifurcation values often exhibit behavior that is very different from the 
analogous situation where the parameters are constant. Let us consider a 
more mathematical example due to Haberman [1979, further studied by 
Schecter [1985]], whose exposition we follow. 



Consider the vector held 
We suppose that 

/(o,m) = o 

so that a; = 0 is always a fixed point, and that 



( 22 . 0 . 1 ) 



( 22 . 0 . 2 ) 
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f{x,fi) = 0 (22.0.3) 

intersects a; = 0 at ^ & and appears as in Figure 22.0.1. 

X 




We further assume that 



dx 



(0,/t) 



is 



f < 0 for X < 6, 
( > 0 for X > 6, 



(22.0.4) 



so that the stability of the fixed points is as shown in Figure 22.0.1. Thus, 
(22.0.1) undergoes a transcritical bifurcation at /i = 6. Now, if we think 
of (22.0.1) as modelling a physical system, we would expect to observe the 
system in a stable equilibrium state. For fj, < b, this would be x = 0, and 
for fj, slightly larger than b, x small enough, this would be the upper branch 
of fixed points bifurcating from the transcritical bifurcation point. 



Let us consider the situation in which the parameter is allowed to drift 
slowly in time as follows 

x = fix,^^), (22.0.5) 

/i = £, (22.0.6) 

where s is viewed as small and positive (so that trajectories always move 
toward the right). 

(Note: Schecter [1985] considers a much more general situation where 
/i may depend on x and fx.) Now let us consider the fate of an initial 
condition (fl,x) with p, < b and x > 0 sufficiently small; see Figure 22.0.2. 
This point is attracted strongly toward x = 0 (but it can never cross x = 0. 
Why?) and drifts slowly toward the right. Schecter proves that, on passing 
through /i = 6, rather than being repelled from x = 0 as would be the 
case for £ = 0, the trajectory follows closely x = 0 (which is an unstable 
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invariant manifold for fx > b) for awhile before ultimately being repelled 
away; see Figure 22.0.2. 



X 




ft 



FIGURE 22.0.2. 

Thus, what one would observe for e = 0 differs from that for £ yf 0; for 
£ yf 0, certain trajectories tend to remain in the neighborhood (for awhile) 
of what are unstable fixed points for £ = 0. 

A detailed analysis of problems of this type is beyond the scope of this 
book (such problems fit very nicely into the context of singular perturba- 
tion theory). The point we wish to make is that, within a given system, 
the behavior of that system on either side of a bifurcation value may be 
very different in cases when the parameter varies slowly in time (no matter 
how slowly) through the bifurcation value, as opposed to cases when the 
parameter is constant. “Slowly varying” saddle-node, transcritical, pitch- 
fork, and Hopf bifurcations for vector fields have all been considered, as 
well as their Hamiltonian analogs. The reader will find detailed analyses 
of such problems in Mitropol’skii [1965], Lebovitz and Schaar [1975, 1977], 
Haberman [1979], Neishtadt [1987], [1988], Baer et al. [1989], Erneux and 
Mandel [1986], Lebovitz and Pesci [1995], and Raman and Bajaj [1998] . 
The recent review of Arnold et al. [1994] provides an overview of many 
of the latest results in this area. Baesens [1991], [1995] considers similar 
phenomena in maps. 
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The Smale Horseshoe 



We will begin our study of “chaotic dynamics” by describing and analyzing 
a two-dimensional map possessing an invariant set having a delightfully 
complicated structure. The discussion is virtually the same as the discussion 
in Wiggins [1988]. Our map is a simplified version of a map first studied 
by Smale [1963], [1980] and, due to the shape of the image of the domain 
of the map, is called a Smale horseshoe. 

We will see that the Smale horseshoe is the prototypical map possess- 
ing a chaotic invariant set (note: the phrase “chaotic invariant set” will 
be precisely defined later on in the discussion). Therefore, we feel that a 
thorough understanding of the Smale horseshoe is absolutely essential for 
understanding what is meant by the term “chaos” as it is applied to the 
dynamics of specific physical systems. For this reason we will first endeavor 
to define as simple a two-dimensional map as possible that still contains 
the necessary ingredients for possessing a complicated and chaotic dynam- 
ical structure so that the reader may get a feel for what is going on in the 
map with a minimum of distractions. As a result, our construction may not 
appeal to those interested in applications, since it may appear rather arti- 
ficial. However, following our discussion of the simplified Smale horseshoe 
map, we will give sufficient conditions for the existence of Smale horseshoe- 
like dynamics in two-dimensional maps that are of a very general nature. 
We will begin by defining the map and then proceed to a geometrical con- 
struction of the invariant set of the map. We will utilize the nature of the 
geometrical construction in such a way as to motivate a description of the 
dynamics of the map on its invariant set by symbolic dynamics, following 
which we will make precise the idea of chaotic dynamics. 



23.1 Definition of the Smale Horseshoe Map 



We will give a combination geometrical-analytical definition of the map. 
Consider a map, /, from the square having sides of unit length into 
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£>= {(a;, 2 /) GK^|0<x< l,0<y< 1}, (23.1.1) 

which contracts the a;-direction, expands the y-direction, and folds D 
around, laying it back on itself as shown in Figure 23.1.1. 

Contraction and 



y expansion 




FIGURE 23.1.1. 

We will assume that / acts affinely on the “horizontal” rectangles 

Hq = {{x,y) G |0 < a; < 1,0 < y < 1/y}, (23.1.2) 

and 

Hi = {(x,y) G |0 < cc < 1,1 - l//r < y < 1}, (23.1.3) 

taking them to the “vertical” rectangles 

/(iJo) = Fo = {(x,y) G I 0 < X < A,0 < y < 1}, (23.1.4) 

and 

f{Hi) = Vi = {(x,y) G M" 1 1 - A < X < 1,0 < y < 1}, (23.1.5) 
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and with 0<A<l/2, /x>2 (note: the fact that, on Hi, the matrix 
elements are negative means that, in addition to being contracted in the x- 
direction by a factor A and expanded in the y-direction by a factor y,. Hi is 
also rotated 180°). Additionally, it follows that f~^ acts on H as shown in 
Figure 23.1.2, taking the “vertical” rectangles Vq and Vi to the “horizontal” 
rectangles Hq and Hi, respectively (note: by “vertical rectangle” we will 
mean a rectangle in D whose sides parallel to the y axis each have length 
one, and by “horizontal rectangle” we will mean a rectangle in D whose 
sides parallel to the x axis each have length one). This serves to define /; 
however, before proceeding to study the dynamics of / on D, there is a 
consequence of the definition of / which we want to single out, since it will 
be very important later. 

y 

X 




FIGURE 23.1.2. 



Lemma 23.1.1 a) Suppose V is a vertical rectangle; then f{V)C\D consists 
of precisely two vertical rectangles, one in Vq and one in Vi, with their 
widths each being equal to a factor of A times the width of V. b) Suppose 
H is a horizontal rectangle; then f~^{H) fl D consists of precisely two 
horizontal rectangles, one in Hq and one in Hi, with their widths being a 
factor ofl/y times the width of H . 



Proof: We will prove Case a). Note that from the definition of /, the hori- 
zontal and vertical boundaries of Hq and Hi are mapped to the horizontal 
and vertical boundaries of Vq and Vi, respectively. Let U be a vertical 
rectangle; then V intersects the horizontal boundaries of Hq and Hi, and 
hence, f{V) fl D consists of two vertical rectangles, one in Vq and one in 
Vi- The contraction of the width follows from the form of / on Hq and Hi, 
which indicates that the x-direction is contracted uniformly by a factor A 
on Hq and Hi. Case b) is proved similarly. See Figure 23.1.3. □ 



We make the following remarks concerning this lemma. 







558 



23. The Smale Horseshoe 



Remark 1. The qualitative features of Lemma 23.1.1 are independent of 
the particular analytical form for / given in (23.1.6); rather, they are more 
geometrical in nature. This will be important in generalizing the results of 
this section to arbitrary maps. 



Remark 2. Lemma 23.1.1 is concerned only with the behavior of / and 
However, we will see in the construction of the invariant set that the 
behavior described in Lemma 23.1.1 allows us to understand the behavior 
of /" for all n. 

We now turn to the construction of the invariant set for /. 



J 



















V 



(a) 

























s^SS' 










(b) 

FIGURE 23.1.3. 




23.2 Construction of the Invariant Set 

We now will geometrically construct the set of points. A, which remain in 
D under all possible iterations by /; thus A is defined as 

• • • n /■”(£>) n • • • n r^{D) non f{D) n • • • n /”(£») n • • • 



or 



n 



We will construct this set inductively, and it will be convenient to construct 
separately the “halves” of A corresponding to the positive iterates and 
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the negative iterates and then take their intersections to obtain A. Before 
proceeding with the construction, we need some notation in order to keep 
track of the iterates of / at each step of the inductive process. Let S = {0, 1} 
be an index set, and let Si denote one of the two elements of S, i.e., Si G S, 
i = 0, ±1, ±2, • • • (note: the reason for this notation will become apparent 
later on). 

We will construct constructing rin=o/"(-^) 

determining the nature of the limit as fc — >■ oo. 



Dr\f{D). By the definition of f, D D f{D) consists of the two vertical 
rectangles Vq and Vi, which we denote as follows 

Dn.f{D)= U Vs_, = {p€D\pGVs_„ s.iGS}, (23.2.1) 

s_ i^S 



where is a vertical rectangle of width A; see Figure 23.2.1. 





FIGURE 23.2.1. 



D n f{D) n P{D). It is easy to see that this set is obtained by acting on 
Dr\f{D) with / and taking the intersection with D, since Dr\f{Dr\f{D)) = 
D n f{D) n P{D). Thus, by Lemma 23.1.1, since D fl f{D) consists of the 
vertical rectangles Vq and V\ with each intersecting Hq and Hi and their 
respective horizontal boundaries in two components, then D D f (D) D (D) 
corresponds to four vertical rectangles, two each in Vq and Vi, with each 
of width A^. Let us write this out more explicitly. Using (23.2.1) we have 

Dnf{D)nf{D) = Dnf{Dnf{D)) = Dnf( |J Vs_X ( 23 . 2 . 2 ) 

^S-2GS ^ 
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where, in substituting (23.2.1) into (23.2.2), we have changed the subscript 
s_i on Vs_^ to Vs _2 - As we will see, this is a notational convenience which 
will be a counting aid. It should be clear that this causes no problems, since 
S-i is merely a dummy variable. Using a few set-theoretic manipulations, 
(23.2.2) becomes 

Dful U Vs_,)= U DDf{Vs_,). (23.2.3) 

'^s_2GS ^ S_2GS 



Now, from Lemma 23 . 1 . 1 , cannot intersect all of D but only VqUVi, 

so ( 23 . 2 . 3 ) becomes 

U ^n/(K_J= U U,_,n/(U,_J. ( 23 . 2 . 4 ) 

S — 2^S S — i^S 

2 = 1,2 



Putting this all together, we have shown that 
Dnf{D)nf{D) 

= U m-.)nK_J= U 

S-i^S s_i^S 

2 = 1, 2 2 = 1,2 

= {p€D\p€Vs_,J-\p) G v;_ 2 ,s-. G 5,z= 1,2}. (23.2.5) 



Pictorially, this set is described in Figure 23.2.2. 




FIGURE 23.2.2. 



D n f{D) n P{D) n P{D). Using the same reasoning as in the previous 
steps, this set consists of eight vertical rectangles, each having width A^, 
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FIGURE 23.2.3. 



which we denote as follows 



Dn f{D)nf{D)nf{D) 

= U ifiVs.,s.,)nVs_,)= U 



s — i^S 
z=l,2,3 



s-iGS 

z=l,2,3 



= {p e D\p GVs-i,f \p)eVs_2, 
f-‘^{p) G U._ 3 ,s-i G S', i = 1,2,3}, 



(23.2.6) 



and is represented pictorially in Figure 23.2.2. 

If we continually repeat this procedure, we almost immediately encounter 
extreme difficulty in trying to represent this process pictorially, as in Fig- 
ures 23.2.1 through 23.2.3. However, using Lemma 23.1.1 and our labeling 
scheme developed above, it is not hard to see that at the step we obtain 

Dnf{D)n---nf{D) 

= U IJ 

s-iGS s-iGS 

i=l,2,...,k i=l,2,...,k 

= {peD\ n+\p) G G S, t = 1, • • • , fc} (23.2.7) 

and that this set consists of 2^ vertical rectangles, each of width A^. 

Before proceeding to discuss the limit as fc — >■ oo, we want to make the 
following important observation concerning the nature of this construction 
process. Note that at the stage, we obtain 2^ vertical rectangles, and 
that each vertical rectangle can be labeled by a sequence of O’s and I’s of 
length k. The important point to realize is that there are 2^ possible distinct 
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sequences of O’s and I’s having length k and that each of these is realized 
in our construction process; thus, the labeling of each vertical rectangle is 
unique at each step. This fact follows from the geometric definition of / 
and the fact that Vq and Vi are disjoint. 

Letting fc — >■ oo, since a decreasing intersection of compact sets is non- 
empty, it is clear that we obtain an infinite number of vertical rectangles 
and that the width of each of these rectangles is zero, since limfe_>oo = 0 
for 0 < A < 1/2. Thus, we have shown that 

OO 

n/”(^)= U 

n— 0 s_i^S 

2 = 1 , 2 ,... 

= U ^-1-. ••• 

s-iGS 
2 = 1,2,... 

= {peD\ n+Hp) G es,i = 1,2,- 

(23.2.8) 



consists of an infinite number of vertical lines and that each line can be 
labeled by a unique infinite sequence of O’s and I’s (note: we will give a 
more detailed set-theoretic description of n()Tg/”(I?) later on). 

Next we will construct f"{^) inductively. 



D D f ^{D). From the definition of /, this set consists of the two horizontal 
rectangles Hq and Hi and is denoted as follows 

Dnf-\D)= \J Hso = {p€D\p€Hs„,So€S}. (23.2.9) 

Sq^S 

See Figure 23.2.4. 




FIGURE 23.2.4. 
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FIGURE 23.2.5. 

D n f~^{D) n We obtain this set from the previously constructed 

set, D n f~^{D), by acting on H fl f~^{D) with f~^ and taking the inter- 
section with D, since D C\ f~^ (^D C\ f~^ (D)) = Dn/“^(I?) n/“^(£>). Also, 
by Lemma 23.1.1, since Hq intersects both vertical boundaries of Vq and 
Vi, as does Hi, Dr\f~^{D)(lf~‘^{D) consists of four horizontal rectangles, 
each of width l//i^. Let us write this out more explicitly. Using (23.2.9), 
we have 

D n r^{D n r^{D)) = Dn [ |J h,, 

= IJ Dnf-\H,,), (23.2.10) 

si^S 



where in substituting (23.2.9) into (23.2.10) we have changed the subscript 
So on Hgg to si. This has no real effect, since Sj is simply a dummy variable. 
The reason for doing so is that it will provide a useful counting aid. 

From Lemma 23.1.1, it follows that /“^(iL^i) cannot intersect all of D, 
only HqU Hi, so that (4.1.14) becomes 

\jDnr\Hs,)^ IJ Hs,nrHHs,). (23.2.11) 

Sx^S Si^ S 

z=0,l 

Putting everything together, we have shown that 

Dnr\D)nrHD) 

= U {.rHHsjnH,,)= y 

Si^S Si^S 

z=0,l z=0,l 

= {p€D\p€Hs„ f{p) €Hs„Si€S,i = 0,l}. (23.2.12) 



See Figure 23.2.5. 
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D n f~^{D) n f~^{D) n f~^{D). Using the same arguments as those given 
in the previous steps, it is not hard to see that this set consists of eight 
horizontal rectangles each having width l//i^ and that it can be denoted 
as 



D n /-‘(D) n /-2(D) n /-3(D) 

= U (r‘(D,.„)nH..)s u HsoSiS2 

SiGS SiGS 

i=0,l,2 i=0,l,2 

= {p€ D\p€ Hso,f{p) e Hs^, 

f{p) = 0, 1, 2}. (23.2.13) 



See Figure 23.2.6. 

Continuing this procedure, at the step we obtain D iT f~^{D) iT • • • fl 
which consists of 2^ horizontal rectangles each having width l//x*. 
This set is denoted by 



Hioo ^^101 




Si^S Si^S 

= {p(^D\f{p) G e S,i = 0,---,k~l}. (23.2.14) 

As in the case of vertical rectangles, we note the important fact that at 
the step of the inductive process, each one of the 2^ vertical rectangles 
can be labeled uniquely with a sequence of O’s and I’s of length k. Now, as 
we take the limit as fc — >■ oo, we arrive at /”(^)> which is an infinite 
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set of horizontal lines, since a decreasing intersection of compact sets is 
nonempty and the width of each component of the intersection is given 
by limfe_>oo(l//t*) = 0, /t > 2. Each line is labeled by a unique infinite 
sequence of O’s and I’s as follows 



n—Q 



f|.rp)= u U 

-OO SiGS SiGS 

2 = 0,1, ••• 2 = 0, 1,--- 

= {p(^D\ Pip) eHs,,Si€S,i = 0P,---}. (23.2.15) 



Thus, we have 

OO 

A = n r{D) = I f| r{D) I n I f| r{D) 



'~n——oo 



^n—0 



(23.2.16) 



which consists of an infinite set of points, since each vertical line in 
n^o/"(A*) intersects each horizontal line in f'^(D) in a unique 

point. Furthermore, each point p € A can be labeled uniquely by a bi- 
infinite sequence of O’s and I’s which is obtained by concatenating the 
sequences associated with the respective vertical and horizontal lines that 
serve to define p. Stated more precisely, let s_i • • • S-k • • • be a particular 
infinite sequence of O’s and I’s; then corresponds to a unique 

vertical line. Let sq ’ ’ ’ Sfc • • ■ likewise be a particular infinite sequence of 
O’s and I’s; then corresponds to a unique horizontal line. Now a 

horizontal line and vertical line intersect in a unique point p; thus, we have 
a well-defined map from points p G A to bi-infinite sequences of O’s and I’s 
which we call 4>. 

4> ^ 

p I >■ • • • S-k ■ ■ ■ S-lSo • • • Sfc • • • . 



Notice that because 

= {p&D\ r*+i(p) G z = 1, . . .} 

= {pG D\f~\p) G Hs_„i= I,---} 

since f{Hs,) = Vs, (23.2.17) 



and 

Hs„...s,... = {p G 7? I r(p) €Hs„i = 0,-- •}, (23.2.18) 



we have 

P = bs_i. bl Hsg...S,---- 
= {peD\ f{p)eHs„i = 0,±i, ±2,- 



(23.2.19) 
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Therefore, we see that the unique sequence of O’s and I’s we have associated 
with p contains information concerning the behavior of p under iteration 
by /. In particular, the s^th element in the sequence associated with p 
indicates that f^{p) G - Now, note that for the bi-infinite sequence of 
O’s and I’s associated with p, the decimal point separates the past iterates 
from the future iterates; thus, the sequence of O’s and I’s associated with 
f^{p) is obtained from the sequence associated with p merely by shifting 
the decimal point in the sequence associated with p k places to the right if 
k is positive or k places to the left if k is negative, until is the symbol 
immediately to the right of the decimal point. We can define a map of 
bi-infinite sequences of O’s and I’s, called the shift map, u, which takes a 
sequence and shifts the decimal point one place to the right. Therefore, 
if we consider a point p G A and its associated bi-infinite sequence of O’s 
and I’s, we can take any iterate of p, f^{p), and we can immediately 
obtain its associated bi-infinite sequence of O’s and I’s given by a^{(j){p)). 
Hence, there is a direct relationship between iterating any point p G A 
under / and iterating the sequence of O’s and I’s associated with p under 
the shift map a. 

Now, at this point, it is not clear where we are going with this analogy 
between points in A and bi-infinite sequences of O’s and I’s since, although 
the sequence associated with a given point p G A contains information on 
the entire future and past as to whether or not it is in Hq ov H\ for any 
given iterate, it is not hard to imagine different points, both contained in 
the same horizontal rectangle after any given iteration, whose orbits are 
completely different. The fact that this cannot happen for our map and 
that the dynamics of / on A are completely modeled by the dynamics of 
the shift map acting on sequences of O’s and I’s is an amazing fact which, 
to justify, we must digress into symbolic dynamics. 



23.3 Symbolic Dynamics 

Let S = {0, 1} be the set of nonnegative integers consisting of 0 and 1. Let 
S be the collection of all bi-infinite sequences of elements of S, i.e., s G T, 
implies 

s = {• • • S_n • • • S_i.So ■ ■ ■ Sn ■ ■ ■}’ Si G S V Z. 

We will refer to S as the space of bi-infinite sequences of two symbols. We 
wish to introduce some structure on S in the form of a metric, d{-, •), which 
we do as follows. Consider 



S — {***S_^***S_i.Sq*** Sn ‘ * *b 
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S — {' * * ^—n * * ‘ l-^O ‘ ‘ ‘ * * *} € ^5 



we define the distance between s and s, denoted d{s, s), as follows 



d{s,s) = 



2hi 



where <5,- = 



0 if Sj — Sj, 

1 if Si yf Sj. 



(23.3.1) 



Thus, two sequences are “close” if they agree on a long central block. (Note: 
the reader should check that d{-,-) does indeed satisfy the properties of a 
metric. See Devaney [1986] for a proof.) 

We consider a map of S into itself, which we shall call the shift map, a, 
defined as follows: For s = {• • • s_„ • • • s_i.soSi • • • s„ • • •} G S, we define 

= {■ ■ ■ •S-n ■ ■ ■ S-ISq.Si • • • s„ • • •} 



or a{s)i = Si+i- Also, cr is continuous; we give a proof of this later in 
Section 24. Next, we want to consider the dynamics of ct on S (note: for 
our purposes the phrase “dynamics of a on S” refers to the orbits of points 
in S under iteration by cr). It should be clear that a has precisely two 
fixed points, namely, the sequence whose elements are all zeros and the 
sequence whose elements are all ones (notation: bi-infinite sequences which 
periodically repeat after some fixed length will be denoted by the finite 
length sequence with an overbar, e.g., {• • • 101010.101010 • • •} is denoted 
by {TOTb}). 

In particular, it is easy to see that the orbits of sequences which period- 
ically repeat are periodic under iteration by a. For example, consider the 
sequence {10.10}. We have 



crjlO.lO} = {01.01}, 



and 

cr{oTTo} = {Toro}; 

thus, 

cr2{T0d0} = {Toro}. 



Therefore, the orbit of {10.10} is an orbit of period two for a. So, from 
this particular example, it is easy to see that for any fixed k, the orbits 
of a having period k correspond to the orbits of sequences made up of 
periodically repeating blocks of O’s and I’s with the blocks having length 
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k. Thus, since for any fixed k the number of sequences having a periodically 
repeating block of length k is finite, we see that a has a countable infinity 
of periodic orbits having all possible periods. We list the first few below. 

Period 1 : {0.0}, {1.1} 

Period 2 : {OLOT} ^ {TOTO} ^ {OTOT} 

Period 3 : {001.001} ^ {010.010} ^ {100.100} ^ {001.001} 

: { 110 . 110 } ^ { 101 . 101 } ^ { 011 . 011 } ^ { 110 . 110 } 

etc. 

Also, a has an uncountable number of nonperiodic orbits. To show this, 
we need only construct a nonperiodic sequence and show that there are an 
uncountable number of such sequences. A proof of this fact goes as follows: 
we can easily associate an infinite sequence of O’s and I’s with a given 
bi-infinite sequence by the following rule 

• • • S-n ■ ■ ■ S-I-So ■ ■ ■ Sn‘ ■ ■ -SoSlS-iS2S-2 ‘ ‘ ‘ ■ 

Now, we will take it as a known fact that the irrational numbers in the 
closed unit interval [0, 1] constitute an uncountable set, and that every 
number in this interval can be expressed in base 2 as a binary expansion 
of O’s and I’s with the irrational numbers corresponding to nonrepeating 
sequences. Thus, we have a one-to-one correspondence between an uncount- 
able set of points and nonrepeating sequences of I’s and O’s. As a result, 
the orbits of these sequences are the nonperiodic orbits of cr, and there are 
an uncountable number of such orbits. 

Another interesting fact concerning the dynamics of cr on S is that there 
exists an element, say s G S, whose orbit is dense in S, i.e., for any given 
s' G S and £ > 0, there exists some integer n such that d(cr”(s), s') < 
£. This is easiest to see by constructing s directly. We do this by first 

constructing all possible sequences of O’s and I’s having length 1, 2, 3, 

This process is well defined in a set-theoretic sense, since there are only 
a finite number of possibilities at each step (more specifically, there are 
2^ distinct sequences of O’s and I’s of length k). The first few of these 
sequences would be as follows 

length 1 : {0},{!} 

length 2 : {00}, {01}, {10}, {11} 

length 3 : {000}, {001}, {010}, {011}, {100}, {101}, {110}, {111} 

etc. 




23.3 Symbolic Dynamics 569 



We can now introduce an ordering on the collection of sequences of O’s and 
I’s in order to keep track of the different sequences in the following way. 
Consider two finite sequences of O’s and I’s 

S * * * S/i;} , S }. 

We can then say 

s < s if k <k' . 

lik = k' , then 

s <s if Si < Si, 

where i is the first integer such that Si yf Si. For example, using this 
ordering we have 

{ 0 } < {!}, 

{ 0 } < { 00 }, 

{00} < {01}, etc. 



This ordering gives us a systematic way of distinguishing different sequences 
that have the same length. Thus, we will denote the sequences of O’s and 
I’s having length k as follows 



s’l < ■■■ < S^k, 



where the superscript refers to the length of the sequence and the subscript 
refers to a particular sequence of length k which is uniquely specified by 
the above ordering scheme. This will give us a systematic way of writing 
down our candidate for a dense orbit. 



Now consider the following sequence 



s = {• • • 



Thus, s contains all possible sequences of O’s and I’s of any fixed length. 
To show that the orbit of s is dense in S, we argue as follows: let s' be 
an arbitrary point in S and let £ > 0 be given. An £-neighborhood of s' 
consists of all points s" G S such that d{s', s") < e, where d is the metric 
given in (23.3.1). Therefore, by definition of the metric on S, there must 
be some integer N = N{e) such that s' = s", |i| < N (note: a proof of this 
statement can be found in Devaney [1986] or in Chapter 24). By construc- 
tion, the finite sequence {s'_jy • • • s'_ 2 .Sq • • • s^y} is contained somewhere in 
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s; therefore, there must be some integer N such that d(^a^{s), s') < e, and 
we can then conclude that the orbit of s is dense in S. 

We summarize these facts concerning the dynamics of ct on S in the 
following theorem. 

Theorem 23.3.1 The shift map a acting on the space of bi-infinite se- 
quences of 0 ’s and 1 ’s, S, has 



i) a countable infinity of periodic orbits of arbitrarily high period; 

ii) an uncountable infinity of nonperiodic orbits; 

iii) a dense orbit. 



23.4 The Dynamics on the Invariant Set 



At this point we want to relate the dynamics of cr on S, about which we have 
a great deal of information, to the dynamics of the Smale horseshoe / on 
its invariant set A, about which we know little except for its complicated 
geometric structure. Recall that we have shown the existence of a well- 
defined map 4> which associates to each point, p G A, a, bi-infinite sequence 
of O’s and I’s, (f{p). Furthermore, we noted that the sequence associated 
with any iterate of p, say f^{p), can be found merely by shifting the decimal 
point in the sequence associated with p k places to the right if k is positive 
or k places to the left if k is negative. In particular, the relation a o <f>(p) = 
4> o /(p) holds for every p G A. Now, if (j) were invertible and continuous 
(continuity is necessary since / is continuous), the following relationship 
would hold 

(j>~^ o a o (j>(p) = f(p) VpGA. (23.4.1) 

Thus, if the orbit p G A under / is denoted by 

{• ■ ■ /"”(D, • • • , f~\p),P, f{p), ■■■, rip),- ■ ■}, (23.4.2) 

then, since o cr o ^(p) = /(p), we see that 

rip) = o cr o (/)) o o a o (jfj o ■ ■ ■ o [4>~^ o a o 4>ip)) 

= (j)~^ o o (j){p) , n > 0. (23.4.3) 



Also, from (23.4.1) we have 
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/ = ^o^(p) ypeA, 

from which we see that 

o o o o a~^ o o • • • o ((/)“^ o a~^ o ^(p)) 

= o cr“” o (/)(p), n > 0. (23.4.4) 



Therefore, using (23.4.2), (23.4.3), and (23.4.4), we see that the orbit of 
p G A under / would correspond directly to the orbit of 4>{p) under a in 
S. In particular, the entire orbit structure of cr on S would be identical to 
the structure of / on A. Hence, in order to verify that this situation holds, 
we need to show that </> is a homeomorphism of A and S. 



Theorem 23.4.1 The map </> : A — >■ S is o homeomorphism. 



Proof: We need only show that </> is one-to-one and continuous, since con- 
tinuity of the inverse will follow from the fact that one-to-one, onto, and 
continuous maps from compact sets into Hausdorff spaces are homeomor- 
phisms (see Dugundji [1966]). We prove each condition separately. 

(f is one-to-one: This means that given p,p' G A, if p yf p', then <f>{p) yf 
We give a proof by contradiction. Suppose p ^ p' and 

(j>{p) = (j){p') = |. . . . . .}. 



Then, by construction of A, p and p' lie in the intersection of the vertical 
line IA_i...s_„... and the horizontal line However, the intersection 

of a horizontal line and a vertical line consists of a unique point; therefore 
p = p' , contradicting our original assumption. This contradiction is due to 
the fact that we have assumed <j){p) = 4>{p')] thus, for p yf p', </)(p) yf <f){p'). 

(j) is onto: This means that given any bi-infinite sequence of O’s and I’s 
in S, say {• • • s_„ • • • s_i.so • • • Sn • • •}, there is a point p G A such that 

(j)[p) = {. . . . . . s_;i.So . . . . . .}. 

The proof goes as follows: Recall the construction of f"'{D) and 
ri-))o /”(^); given any infinite sequence of O’s and I’s, {• • • s_„ • • • S-i.}, 
there is a unique vertical line in corresponding to this 

sequence. Similarly, given any infinite sequence of O’s and I’s, {.sq 
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•••Sn"’’}) there is a unique horizontal line in correspond- 

ing to this sequence. Therefore, we see that for a given horizontal and 
vertical line, we can associate a unique bi-infinite sequence of O’s and I’s, 
{• • • s_„ • • • s_i.So • • • s„ • • •} and, since a horizontal and vertical line inter- 
sect in a unique point, p, to every bi-infinite sequence of O’s and I’s, there 
corresponds a unique point in A. 

4> is continuous: This means that, given any point p € A and £ > 0, we can 
find a (5 = 6{e,p) such that 

\p — p'\<S implies d[4>{p), 4>{p')) < s, 

where | ■ | is the usual distance measurement in and d(-, •) is the metric 
on S introduced earlier. 

Let £ > 0 be given; then, if we are to have d{(j){p) , cj){p')) < e, there must 
be some integer N = N{e) such that if 

4>{p) = 

^(p') = 

then Si = s'i, i = 0, ±1, . . . , ±fV. Thus, by construction of A, p and p' lie in 
the rectangle defined by see Figure 23.4.1. Recall that 

the width and height of this rectangle are and respectively. 

Thus we have |p — p'| < {X^ + Therefore, if we take S = X^ + 

continuity is proved. □ 




We make the following remarks. 

Remark 1. Recall from Chapter 12, that the dynamical systems / acting 
on A and cr acting on S are said to be topologically conjugate if ^ o /(p) = 
(TO(j){p). (Note: the equation (po f{p) = ao(j){p) is also expressed by saying 
that the following diagram “commutes.”) 

A -4 A 

(pi i (p 

S ^ S 

Remark 2. The fact that A and S are homeomorphic allows us to make 
several conclusions concerning the set-theoretic nature of A. We have al- 
ready shown that S is uncountable, and we state without proof that S is a 




23.5 Chaos 



573 







FIGURE 23.4.1. 

closed, perfect (meaning every point is a limit point), totally disconnected 
set and that these properties carry over to A via the homeomorphism (j). A 
set having these properties is called a Cantor set. We will give more detailed 
information concerning symbolic dynamics and Cantor sets in Chapter 24. 



Now we can state a theorem regarding the dynamics of / on A that is 
almost precisely the same as Theorem 23.3.1, which describes the dynamics 
of a on S. 

Theorem 23.4.2 The Smale horseshoe, f, has 

i) a countable infinity of periodic orbits of arbitrarily high period. These 
periodic orbits are all of saddle type; 

ii) an uncountable infinity of non-periodic orbits; 

iii) a dense orbit. 

Proof: This is an immediate consequence of the topological conjugacy of 
/ on A with a on S, except for the stability result. The stability result 
follows from the form of / on Hq and Hi given in (23.1.6). □ 



23.5 Chaos 

Now we can make precise the statement that the dynamics of / on A is 
chaotic. 
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Let p G A with corresponding symbol sequence 

(j){p) = {. . . . . . s_i.so • • • s„ • • •}. 



We want to consider points close to p and how they behave under itera- 
tion by / as compared with p. Let e > 0 be given; then we consider an 
e-neighborhood of p determined by the usual topology of the plane. Hence 
there also exists an integer N = N{e) such that the corresponding neighbor- 
hood of (j){p) includes the set of sequences s' = { • • • s'_„ • • • s'_;^ .Sq • • • s^ • • • } 
G S such that Si = s', |i| < N. Now suppose the iV-|-l entry in the sequence 
corresponding to 4>{p) is 0, and the A^-|- 1 entry in the sequence correspond- 
ing to some s' is 1. Thus, after N iterations, no matter how small e, the 
point p is in Hq; the point, say p', corresponding to s' under is in Hi, 
and they are at least a distance 1 — 2A apart. Therefore, for any point p G A, 
no matter how small a neighborhood of p we consider, there is at least one 
point in this neighborhood such that, after a finite number of iterations, p 
and this point have separated by some fixed distance. A system displaying 
such behavior is said to exhibit sensitive dependence on initial conditions. 
A dynamical system displaying sensitive dependence on initial conditions 
on a closed invariant set (which consists of more than one orbit) will be 
called chaotic. 



23.6 Final Remarks and Observations 



Now we want to end our discussion of this simplified version of the Smale 
horseshoe with some final observations. 




FIGURE 23.6.1. 
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1. If you consider carefully the main ingredients of / which led to The- 
orem 23.4.2, you will see that there are two key elements. 

(a) The square is contracted, expanded, and folded in such a way 
that we can find disjoint regions that are mapped over them- 
selves. 

(b) There exists “strong” stretching and contraction in complemen- 
tary directions. 

2. From observation 1), the fact that the image of the square appears in 
the shape of a horseshoe is not important. Other possible scenarios 
are shown in Figure 23.6.1. 

3. Notice that, in our study of the invariant set of /, we do not consider 
the question of the geometry of the points which escape from the 
square. We remark that this could be an interesting research topic, 
since this more global question may enable one to determine condi- 
tions under which the horseshoe becomes an attractor. 

4. Some interesting references related to the Smale horseshoe are Bowen 
[1975b], Young [1981], Hall [1994] and Kennedy and Yorke[ 2001]. 
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Symbolic Dynamics 



In the previous section we saw an example of a two-dimensional map which 
possessed an invariant Cantor set. The map, restricted to its invariant set, 
was shown to have a countable infinity of periodic orbits of all periods, an 
uncountable infinity of nonperiodic orbits, and a dense orbit. Now, in gen- 
eral, the determination of such detailed information concerning the orbit 
structure of a map is not possible. However, in our example we were able to 
show that the map restricted to its invariant set behaved the same as the 
shift map acting on the space of bi-infinite sequences of O’s and I’s (more 
precisely, these two dynamical systems were shown to be topologically con- 
jugate; thus their orbit structures are identical). The shift map was no less 
complicated than our original map but, due to its structure, many of the 
features concerning its dynamics (e.g., the nature and number of its peri- 
odic orbits) were more or less obvious. The technique of characterizing the 
orbit structure of a dynamical system via infinite sequences of “symbols” 
(in our case O’s and I’s) is known as symbolic dynamics. The technique is 
not new and appears to have originally been applied by Hadamard [1898] 
in the study of geodesics on surfaces of negative curvature and Birkhoff 
[1927], [1935] in his studies of dynamical systems. The first exposition of 
symbolic dynamics as an independent subject was given by Morse and Hed- 
lund [1938]. Applications of this idea to differential equations can be found 
in Levinson’s work on the forced van der Pol equation (Levinson [1949]), 
from which came Smale’s inspiration for his construction of the horseshoe 
map (Smale [1963], [1980]), and also in the work of Alekseev [1968], [1969], 
who gives a systematic account of the technique and applies it to problems 
arising from celestial mechanics. These references by no means represent a 
complete account of the history of symbolic dynamics or of its applications, 
and we refer the reader to the bibliographies of the above listed references 
or to Moser [1973] or Lind and Marcus [1995] for a more complete list of 
references on the subject and its applications. In recent times (say from 
about 1965 to the present) there has been a flood of applications of the 
technique. 

Symbolic dynamics will play a key role in explaining the dynamical phe- 
nomena we encounter in this chapter. For this reason, we now want to 
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describe some aspects of symbolic dynamics viewed as an independent sub- 
ject. Our discussion follows Wiggins [1988]. 

We let S = {1, 2, 3, • • • , A^}, iV > 2 be our collection of symbols. We 
will build our sequences from elements of S. Note that for the purpose of 
constructing sequences, the elements of S could be anything, e.g., letters 
of the alphabet, Chinese characters, etc. We will use positive integers since 
they are familiar, easy to write down, and we have as many of them as we 
desire. 



24.1 The Structure of the Space of Symbol 
Sequences 



We now want to construct the space of all symbol sequences, which we 
will refer to as from elements of S and derive some properties of 
It will be convenient to construct as a Cartesian product of infinitely 
many copies of S. This construction will allow us to make some conclusions 
concerning the properties of based only on our knowledge of S, the 
structure which we give to S, and topological theorems on infinite products. 

We now give some structure to S; specifically, we want to make S into a 
metric space, which can be done with the following metric 

r 1 if a^h, 

d{a,b)=\ (24.1.1) 

1 0 if a = b. 



It is trivial to check that d{-, •) is a metric. 

The metric (24.1.1) actually induces the discrete topology on S, i.e., 
the topology defined by the collection of all subsets of S, see Munkres 
[1975]. 

Since S consists of a finite number of points, it is trivial to verify that it is 
compact. Moreover, S is totally disconnected, i.e., its only connected sub- 
sets are one-point sets. We summarize the properties of S in the following 
proposition. 

Proposition 24.1.1 The set S equipped with the metric (24-Tl) is a com- 
pact, totally disconnected, metric space. 
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We remark that compact metric spaces are automatically complete met- 
ric spaces (see Munkres [1975], Section 7-3, Theorem 3.1). 

Now we will construct as a bi-infinite Cartesian product of copies of 
S': 

OO 

S^ = ---xSxSxSxSx---= S* where S* = S Vi. (24.1.2) 

i— — oo 



Thus, a point in is represented as a “bi-infinity-tuple” of elements of 
S: 

s G ^ s = {• • • , s_„, • • • , s_i. So, si, • • • 5 s„, • • •} where s^ G S Vi, 

or, more succintly, we will write s as 

s = {• • • s_„ • • • S-i.sqSi • • • s„ • • •} where Si G S Vi. 



A word should be said about the “decimal point” that appears in each 
symbol sequence and has the effect of separating the symbol sequence into 
two parts, with both parts being infinite (hence the reason for the phrase 
“bi-infinite sequence”). At present it does not play a major role in our 
discussion and could easily be left out with all of our results describing the 
structure of going through just the same. In some sense, it serves as a 
starting point for constructing the sequences by giving us a natural way of 
subscripting each element of a sequence. This notation will prove convenient 
shortly when we define a metric on However, the real significance of 
the decimal point will become apparent when we define and discuss the 
shift map acting on and its orbit structure. 

In order to discuss limit processes in , it will be convenient to define a 
metric on Since S' is a metric space, it is also possible to define a metric 
on There are many possible choices for a metric on however, we 
will utilize the following. For 

S = {• • • S-n ■ ■ ■ S-I.SqSI • • • Sn • • •}, 

S = {* * * 5 — n * ‘ ‘ ■ ‘ ‘ * * *} G FI , 



the distance between s and s is defined as 



d{s, s) = 



1 di{si,Si) 

^ 2l®l 1 -I- di(si. Si) ’ 



(24.1.3) 
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where di{-, •) is the metric on S'* = S' defined in (24.1.1). The reader should 
verify that (24.1.3) indeed defines a metric. Intuitively, this choice of metric 
implies that two symbol sequences are “close” if they agree on a long central 
block. The following lemma makes this precise. 

Lemma 24.1.2 For s,s€ 

i) Suppose d{s,s) < 1/(2^+^); then Si = Si for all |i| < M. 

ii) Suppose Si = Si for |z| < M; then d{s,s) < 1/(2^). 



Proof: The proof of i) is by contradiction. Suppose the hypothesis of i) 
holds and there exists some j with \j\ < M such that sj yf sj. Then there 
exists a term in the sum defining d(s, s) of the form 

^ dj {sj , Sj ) 

2bl l + dj{sj,Sj) 

However, since sj yf Sj, 

dj (sj , Sj ) _ 1 

l + dj(sj,Sj) 2 



and each term in the sum defining d(s, s) is positive so that we have 



,/ , 1 do (Sd, 54 ) 

d(s,s) > -r- ^ , 

2l-?l l + dj(Sj,Sy) 



1 ^ 1 
2b'l+i - 2^+1’ 



but this contradicts the hypothesis of i). 



We now prove ii). If Si = Si for |z| < M, we have 



d{s, s) 



i=-(M+l) 

w\ 



dii^Si , 5j) 

1 di(^Si^ Sij 



oo 

+ E 

i=M+l 



1 

W\ 



dii^Si^ sf) 

1 “t“ dii^Si^ S^) 



however, (^di{si, Si) /{I + di{si,Si))) = 1/2, so we obtain 

oo 1 ^ 

d{s,s)<2 ^ □ 

i=M+l 



Armed with our metric, we can define neighborhoods of points in 
and describe limit processes. Suppose we are given a point 
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s= G Si e S Vi, (24.1.4) 

and a positive real number e > 0, and we wish to describe the “e-neighbor- 
hood of s”, i.e., the set of s G such that d{s,s) < e. Then, by 
Lemma 24.1.2, given e > 0, we can find a positive integer M = M{e) 
such that d{s,s) < e implies Si = Si V|i| < M. Thus, our notation for an 
e-neighborhood of an arbitrary s G will be as follows 

AT^(^)(s) = {sGS^Is, = Si V|i| < M, s„Si G S' Vi}. 



Before stating our theorem concerning the structure of we need the 
following definition. 



Definition 24.1.3 A set is called perfect if it is closed and every point 
in the set is a limit point of the set. 



We are now ready to state our main theorem concerning the structure 
of 



Proposition 24.1.4 The space equipped with the metric (24-1.3) is 



i) compact, 

ii) totally disconnected, and 

iii) perfect. 



Proof: i) Since S is compact, is compact by Tychonov’s theorem 
(Munkres [1975], Section 5-1). 

ii) By Proposition 24.1.1, S is totally disconnected, and therefore 

is totally disconnected, since the product of totally disconnected spaces is 
likewise totally disconnected (Dugundji [1966]). 

iii) is closed, since it is a compact metric space. Let s G be an 
arbitrary point in then, to show that s is a limit point of we 
need only show that every neighborhood of s contains a point s yf s with 
s G Let A/’^^^^(s) be a neighborhood of s and let s = SM(e)+i + 1 if 
sm(s)+i 7^ and s = sm(s)+i — 1 if sm{s)+i = Then the sequence 

{■ • • S-M{e)-2SS-M(e) ' ' ’ S-I-SqSi ’ ’ ’ S M{e)SS M{s)+2 ‘ ‘ ’} 
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is contained in and is not equal to s; thus is perfect. □ 

We remark that the three properties of stated in Proposition 24.1.4 
are often taken as the defining properties of a Cantor set, of which the 
classical Cantor “middle-thirds” set is a prime example. 

The following theorem of Cantor gives us information concerning the 
cardinality of perfect sets. 



Theorem 24.1.5 Every perfect set in a complete space has at least the 
cardinality of the continuum. 

Proof: See Hausdorff [1957]. □ 

Hence, is uncountable. 



24.2 The Shift Map 



Now that we have established the structure of , we define a map on , 
denoted by cr, as follows. For s = {• • • s_„ • • • s_i.soSi • • • • • •} G we 

define 

(j(s) — {' * * ^—n * ‘ ‘ ■ ‘ ‘ ‘ ‘ *h 

or [cr(s)]i = Si+i. The map a is referred to as the shift map, and when the 
domain of a is taken to be all of it is often referred to as a full shift on 
N symbols. We have the following proposition concerning some properties 
of a. 



Proposition 24.2.1 i) ii) cr is continuous. 



Proof: The proof of i) is obvious. To prove ii) we must show that, given 
e > 0, there exists a 6{e) such that d{s,s) < S implies d(cr(s), cr(s)) < e for 
s,s G Suppose £ > 0 is given; then choose M such that 1/(2^“^) < e. 
If we then let <5 = 1/2^+^, we see by Lemma 24.1.2 that d{s, s) < 6 implies 
Si = Si for jzj < M; hence, [cr(s)]i = [cr(s)]i, jij < M — 1. Then, also by 
Lemma 24.1.2, we have d{a{.s),a{s)) < 1/2^“^ < e. □ 

We now want to consider the orbit structure of a acting on . We have 
the following proposition. 
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Proposition 24.2.2 The shift map a has 



i) a countable infinity of periodic orbits consisting of orbits of all periods; 

ii) an uncountable infinity of nonperiodic orbits; and 

iii) a dense orbit. 

Proof: i) This is proven in exactly the same way as the analogous result 
obtained in our discussion of the symbolic dynamics for the Smale horse- 
shoe map in Section 23.3. In particular, the orbits of the periodic symbol 
sequences are periodic, and there is a countable infinity of such sequences, 
ii) By Theorem 24.1.5 is uncountable; thus, removing the countable 
infinity of periodic symbol sequences leaves an uncountable number of non- 
periodic symbol sequences. Since the orbits of the nonperiodic sequences 
never repeat, this proves ii). iii) This is proven in exactly the same way as 
the analogous result obtained in our discussion of the Smale horseshoe map 
in Section 23; namely, we form a symbol sequence by stringing together all 
possible symbol sequences of any finite length. The orbit of this sequence 
is dense in since, by construction, some iterate of this symbol sequence 
will be arbitrarily close to any given symbol sequence in □ 



24.3 Exercises 



1. Compute the distinct period k orbits, fc — 1, 2, 3, 4, 5, for a : TV = 2, 3, 4, 5. 

2. Prove that a : has an infinite number of dense orbits. Are these dense 

orbits countable? 

3. Consider a sequence s G E^. Let 0(s) denote the orbit of s under the shift map a. 
Then the orbit of a sequence s G E^ is said to be homoclinic to 0(s) if 

lim d{(T^ (s) , O (s)) — 0. 

(a) Prove that any periodic sequence s has a countable number of homoclinic orbits. 

(b) Is this true for nonperiodic sequences s? 

(c) Can a periodic sequence have an infinite number of homoclinic orbits? 

4. Consider two sequences s, s G E^. Then the orbit of any point s G E"^ is said to be 
heteroclinic to 0{s) and 0{s) if 

lim d((T^(s), 0(s)) — 0 and lim d({7^(s), 0(s)) — 0. 

(a) Prove that any periodic sequences s, s G E^ , have a countable number of 
heteroclinic orbits. 

(b) Is this true for nonperiodic sequences? 

(c) Can two periodic sequences have an infinite number of heteroclinic orbits? 
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5. Show how the homoclinic orbits in 2^ can be characterized as the limit of a sequence 
of periodic orbits with increasing periods. 

6. Give a direct proof that is totally disconnected. (Hint: using the definition of 

neighborhood given in (24.1.4), choose any two sequences, s, s G 2^, and construct 
neighborhoods of each, denoted {s), that satisfy (s) {s) — 0 and 

7. This exercise is concerned with the classical Cantor “middle-thirds” set. This set is 
constructed inductively as follows: Begin with the unit interval, denoted Co for nota- 
tional purposes, and remove the open interval (^, §) called the “middle third”. We 
refer to the remainder as Ci. Thus 
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(b) C is compact. 

(c) C is closed. 

(d) C is uncountable. 

(e) C is perfect. 

(f) C is totally disconnected. 

(g) C has Lebesgue measure zero. 
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The Conley— Moser Conditions, 
or “How to Prove That a 
Dynamical System is Chaotic” 



In this section we will give sufficient conditions in order for a two- 
dimensional invertible map to have an invariant Cantor set on which the 
dynamics are topologically conjugate to a full shift on N symbols {N > 2). 
These conditions were first given by Conley and Moser (see Moser [1973]), 
and we give slight improvements on their estimates. Alekseev [1968], [1969] 
developed similar criteria. Generalizations to n-dimensions can be found 
in Wiggins [1988] and Li and Wiggins [1997]. Generalizations to nonau- 
tonomous systems can be found in Wiggins [1999]. 



25.1 The Main Theorem 

We begin with several definitions. 

Definition 25.1.1 A /i„-vertical curve is the graph of a function x = v{y) 
for which 

0 < v{y) < 1, \v{yi) - v{y 2 )\ < p.v\yi - J/ 2 | for 0 < yi, 7/2 < 1- 

Similarly, a /i;, -horizontal curve is the graph of a function y = h{x) for 
which 

0 < h{x) < 1, \h{xi) — h{x 2 )\ < yh\xi — X 2 I for 0 < x±, X 2 < 1; 
see Figure 25.1.1. 



We make the following remarks concerning Definition 25.1.1. 
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x=v(y) 




FIGURE 25.1.1. 

Remark 1. Functions x = v{y) and y = h{x) satisfying Definition 25.1.1 are 
called Lipschitz functions with Lipschitz constants /i„ and Hh, respectively. 

Remark 2. The constant pLu can be interpreted as a bound on the slope of 
the curve defined by the graph oi y = h{x). A similar interpretation holds 
for and the graph oi x = v{y) . 

Remark 3. For y,y = 0, the graph of x = v{y) is a vertical line and, for 
yh = 0, the graph oi y = h{x) is a horizontal line. 

Remark 4- At this point we have put no restrictions on the relationship or 
magnitudes of and yu- 

Next we want to “fatten up” these /Xt,-vertical curves and /r?i-horizontal 
curves into /u„-vertical strips and /x/j-horizontal strips, respectively. 



Definition 25.1.2 Given two nonintersecting yy-vertical curves vi{y) < 
i’ 2 (y); y G [0, 1], we define a ^^-vertical strip as 

V = {(x,y) G |a; G [vi{y),V 2 iy)]; y G [0,1]}. 



Similarly, given two nonintersecting yn-horizontal curves hfix) < h 2 {x), 
X G [0, 1], we define a /i/i -horizontal strip as 

H = {{x,y) e'Efi\y e [hi{x) , h 2 {x)]; x G [0,1]}; 



see Figure 25.1.2. The width of horizontal and vertical strips is defined as 
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d(H)= max \h 2 (x) — hi(x)\, 

x€[0,l] 

d{V)= max\v 2 {y)-vi{y)\. 
ye [0.1] 



(25.1.1) 

(25.1.2) 



x=v,(y) x=V 2 (y) 




FIGURE 25.1.2. 

The following two lemmas will play an important role in the inductive 
process of constructing the invariant set for the map /. 



Lemma 25.1.3 i) If D D ■■■ D D ■■■ is a nested sequence of 
yv-vertical strips with d{V^) 0 as k ^ oo, then ® 

pLy-vertical curve. 

ii) If D D • • • D D • • • is a nested sequence of yh-horizontal 
strips with d{H^) — >■ 0 as fc — >■ oo, then H^i = H°° is a yu-horizontal 
curve. 

Proof: We will prove i) only, since the proof of ii) requires only trivial 
modifications. 

Let denote the set of Lipschitz functions with Lipschitz con- 

stant pLv defined on the interval [0, 1]. Then with the metric defined by the 
maximum norm, [0, 1] is a complete metric space (see Arnold [1973] for 
a proof). Let x = Vi{y) and x = v^iy) form the vertical boundaries of the 
/x„-vertical strip V^. Now consider the sequence 

{^>1 (y) , (y) , vj (y) ,vl{y),---,v^{y),v^{y),---}. (25. 1.3) 



By definition of the U*, (25.1.3) is a sequence of elements of C^„[0, 1], and 
since d{V^) — >■ 0 as /c — >■ oo, it is a Cauchy sequence. Therefore, since 
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is a complete metric space, the Cauchy sequence converges to a 
unique /i„- vertical curve. This proves i). □ 

Lemma 25.1.4 Suppose 0 < < 1. Then a pLy-vertical curve and a 

Hh-horizontal curve intersect in a unique point. 

Proof: Let the /x/j-horizontal curve be given by the graph of 

y = h{x), 

and let the ^„-vertical curve be given by the graph of 

X = v{y). 

The condition for intersection is that there exists a point (x, y) in the unit 
square satisfying each relation, i.e., we have 

y=h{x), (25.1.4) 

where x in (25.1.4) satisfies 

X = v{y), 

in other words, the equation 

y = h{v{y)) (25.1.5) 

has a solution. We want to show that this solution is unique. We will use 
the contraction mapping theorem (see Arnold [1973]). 

Let us give some background. Consider a map 

g-.M — > M, 

where M is a complete metric space. Then g is said to be a contraction 
map if 

|g(mi) - 5 (to 2)| < /cjmi - 7712 ], mi,m 2 &M, 

for some constant 0 < fc < 1, where | • | denotes the metric on M. The 
contraction mapping theorem says that g has a unique fixed point, i.e., 
there exists one point fh G M such that 

g(jh) = fh. 
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We now apply this to our situation. 

Let / denote the closed unit interval, i.e., 

I={y€R^\0<y<l}. 

Clearly / is a complete metric space. Also, it should be evident that 

hov.I — >1. (25.1.6) 

Hence, if we show that /i o r; is a contraction map, then by the contraction 
mapping theorem (25.1.6) has a unique solution and we are done. This is 
just a simple computation. For j/i,y 2 G d we have 

\h{v{.yi)) - h{v(y2))\ < y.h\v{jji) - v{y2)\ 

< - 2/2 I- 

Since we have assumed 0 < < 1, h o v is a, contraction map. □ 

With these technical ideas and results established we can now turn to 
the main business of this section. We consider a map 

where D is the unit square in i.e., 

H = {(x, y) G I 0 < cc < 1, 0 < y < l}. 

Let 

S = {1,2,---,N}, (iV>2), 

be an index set, and let 

be a set of disjoint /r/i-horizontal strips. Finally, let 

z=l,---,/V, 

be a set of disjoint /r^-vertical strips. Suppose that / satisfies the following 
two conditions. 




590 



25. The Conley-Moser Conditions 



Assumption 1. 0 < < 1 and / maps Hi homeomorphically onto 

if (Hi) = Vi) for i = 1, • • • , N. Moreover, the horizontal boundaries of 
map to the horizontal boundaries of Vi and the vertical boundaries of 
map to the vertical boundaries of Vi. 

Assumption 2. Suppose H is a, /ift,-horizontal strip contained in 
Then 

f-\H)nH, = Hi 

is a /i^-horizontal strip for every i G S. Moreover, 

d{Hi) < Vhd{H) for some 0 < r-zj < 1. 

Similarly, suppose M is a /i„-vertical strip contained in Then 

f{V) AVi = V 

is a /i„-vertical strip for every i € S. Moreover, 

d{Vi) < Vyd{V) for some 0 < < 1. 

Now we can state our main theorem. 



Theorem 25.1.5 Suppose f satisfies Assumptions 1 and 2. Then f has 
an invariant Cantor set, A, on which it is topologically conjugate to a full 
shift on N symbols, i.e., the following diagram commutes 



A 



A 



4 > f 4-0 




where is a homeomorphism mapping A onto . 

The proof has four steps. 

Step 1. Construct A. 

Step 2. Define the map </>: A — > 



tq tq 
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Step 3. Show that </> is a homeomorphism. 
Step 4. Show that (j) o f = a o (j). 



Proof: Step 1 : Construction of the Invariant Set. The construction of the 
invariant set of the map is very similar to the construction of the invariant 
set for the Smale horseshoe in Chapter 23. We first construct a set of 
points that remains in IJjgg Vi under all backward iterates. This will turn 
out to be an uncountable infinity of y^t,-vertical curves. Next we construct 
a set of points that remains in IJjgg Hi under all forward iterates. This 
will turn out to be an uncountable infinity of /i^-horizontal curves. Then 
the intersection of these two sets is clearly an invariant set contained in 

The reader may wonder why our terminology here is different than that 
used in the discussion of the construction of the invariant set for the Smale 
horseshoe. In that case the invariant set, A, was given by 

OO 

A= n riD)- 



However, for the Smale horseshoe we knew how the map acted on all of 
D. Namely, the part of D not contained in Hq U Hi was “thrown out” 
of D under the action of /. We have not assumed such behavior in the 
situation presently under consideration. We know only how the map / acts 
on UieS UieS comment more on this 

following the proof of the theorem. 

We begin by inductively constructing the set of points in UieS ^ 
remain in IJieS under all backwards iterations by /. We denote this set 
by A_oo and A_„, n = 1, 2, • • • denoting the set of points in IJjgg Vi that 
remain in IJjgg Vi under n — 1 backwards iterations by /. 

In the following arguments we will repeatedly nse the following set the- 
oretic identities 

(U^*)n(U^0= U 

\iex / \jej / ieijGj 

where I and J are index sets for the sets Ai and Bj, respectively. Also, 
for a function f : A ^ B, with subsets Ai, A 2 G A and Bi, B 2 G B, 
we have 
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/(AiU^2) = /(^i)U/(A2), 

/-1(BiUB2) = /-'(Bi)U/-1(B2), 

f{Ai n A 2 ) = f{Ai) n /(A 2 ), requires f to be 1-1, 

/-i(BinB2) = r'(Bi)n/-i(B2). 



The reader can find proofs in virtually any set theory or topology text- 
book. 



A_i. A_i is obvious. 

A_i= IJ 

s — i^S 



A_ 2 . It should be clear that 



a_ 2 = /(A_i) n 




(25.1.7) 



(25.1.8) 



is the set of points in Us_ieS mapped into A_i under / ^ 

Then, using (25.1.7), (25.1.8) becomes 



A_2 = 



U n 



k S — 2^S 




= u fiVs. 2 )nVs_,= U 

s—i^S s — i^S 

z=l,2 2=1,2 



(25.1.9) 



We note the following. 



i) Vs_,s .2 = {P & D\p eVs_,,f ^(p) G with 1 A_is _2 C lA_i- 

ii) It follows from Assumptions 1 and 2 that IA_is_ 2 ? ^-i ^ S, i = 
1, 2, is /x„-vertical strips with N of them in each of the Vi, i G 
S. Note that there are iV^ sequences of length two that are made 
up of elements of S and that the 14_is_2 can be put in one-to-one 
correspondence with these sequences. 

iii) It follows from Assumption 2 that 



d{Vs_\s_2) — ^vd(lA_,) ^ Vv 



(25.1.10) 
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A_ 3 . We construct A _3 from A _2 as follows 



A-3 = /(A_ 2 ) n 




(25.1.11) 



Hence, (25.1.11) is the set of points in that are mapped into 

A _2 under f~^. Using (25.1.9), (25.1.11) becomes 



A-3 = f 



( 



I s—i 

\ z=2,3 



\ 

n 


' U 


/ 


\s-ias / 



= u fiVs.Jnf{Vs_JnVs_, 

s-tes 

i=l,2,3 

= U ^S-lS_2S-3J 



(25.1.12) 



S-iGS 

i=l,2,3 



noindent where we have the following. 



i) Vs_^s_ 2 s_^ = {p ^ D\p G Vs_^J ^{p)GVs_^jf ^(p) G T^_ 3 } with 
hs_iS_2S-3 ^ h^S_iS_2 ^ 

ii) It follows from Assumptions 1 and 2 that Ug_is_ 2 s_ 3 , S-i € S, i = 
1,2,3, is /Lt„-vertical strips with N'^ of them in each of the V), 
i G S. Note that there are sequences of length three made up 
of elements of S and that the I4_js_2s_3 can be put in one-to-one 
correspondence with these sequences. 

iii) It follows from Assumption 2 that 

d(v;_33_2._3) < < vld{Vs_,) < vl. (25.1.13) 



This procedure can be carried on indefinitely. At the {k + 1)*^ step we 
have 



A-fc-i = / (A_fe) n j [J 

k 






= f 



\ 



u P”'(v;_._Jn---n/(U 3 _ 3 )nv ;_2 



I s-i£S 
\z=2,---,fc+l 



) 
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s-i^S 

= U (25.1.14) 

s-iGS 

where we have the following. 

i) Vs_-,...s_k-i = {p G D\f-^+^{p) G Vs_i,i = l,2,...,fc+l} with 

C Vs_i..-S-k C C Vs_^S-2 *— ^S_1- 

ii) It follows from Assumptions 1 and 2 that S-i G S, i = 

1, • • • , A: + 1, is y^^-vertical strips with of them in each of 

the Vi, i £ S. Note that there are sequences of length k + 1 

constructed from elements of S and that these sequences can be put 
in one-to-one correspondence with the 

iii) From Assumption 2 it follows that 

^(^S_i...S_fc_i) ^ k'yd{Vs_i..-S-k) — + l ) 

< i'vd{Vs_,...s_^+2) <■■■< }^vd{Vs_,) 

< (25.1.15) 

It follows from Assumptions 1 and 2 that in passing to the limit as fc — >■ oo 
we obtain 

A_oo= U ■■■nf{Vs_,_Jn---nf{Vs_,)nVs_, 

s-i^S 

= U ^-1 (25-1-16) 

s-tGS 

which, from Lemma 25.1.3, consists of an infinite number of /it,-vertical 
curves. This follows from the fact that given any infinite sequence made 
up of elements of S, say 

S-lS-2 • • • S-k • • • , 



we have (by the construction process) an element of A_oo which we denote 

by 
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Vs-iS-2---S-k---- 

Now, by construction, is the intersection of the following nested 

sequence of sets 

D • • • D D • • • , 

where from (25.1.15) it follows that 

— >■ 0 as fc — >■ oo. 



Thus, by Lemma 25.1.3, 

OO 



k=l 



consists of a /i„-vertical curve. It also follows by construction that 

K_i...s_,... = {p€D\ n+\p) G i = 1, 2, • • •}. (25.1.17) 



We next construct Aqo, the set of points in UieS'^* remain in 

Uies under all forward iterations by /. We denote the set of points 
that remain in IJjgg Hi under n iterations by / by A„. Since the construc- 
tion of Aoo is very similar to the construction of A_oo we will leave out 
many details that we explicitly noted in the construction of A_oo. 



We have 



Ao= U ff.o- 

Sq^S 



(25.1.18) 



Ai. 



Ai = /-i(Ao)n ( U 77, J 

\SqGS / 



(25.1.19) 



is the set of points in IJsoeS map into Aq under /. Therefore, 

using (25.1.18), (25.1.19) becomes 



Ai = rM U U ^^0 

\siGS / \soGS / 

= U ri(77,jn77,„^ U 77„ 



(25.1.20) 



SiGS 

i=0,l 



SiGS 

i=0,l 



where we have the following. 
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i) Hsosi = {p & D\p e Hso,f{p) e Hs,}. 

ii) It follows from Assumptions 1 and 2 that Ai consists of ph- 
horizontal strips with N of them in each of the Hi, i G S. Moreover, 
there are N'^ sequences of length two made up of elements of S and 
these can be put in one-to-one correspondence with the i?sosi- 

iii) From Assumption 2 we have 

d{Hs,s,)<Vhd{Hs,)<Vh- (25.1.21) 



Continuing the construction in this manner and repeatedly appealing to 
Assumptions 1 and 2 allows us to conclude that 



Afc = n U ^^0 

\soes / 

Si^S 

= U (25.1.22) 

SiGS 

consists of /i/j -horizontal strips with of them in each of the Hi, 

i G S. Moreover, 

d(i?«o-.j < 



It should also be clear that 



Hs,...s, = {pGD\f{p)GHs„i = 0,l,---,k} 

and that there are sequences of length fc -|- 1 made up of elements of 

S which can be put in a one-to-one correspondence with the Hsg...si,- 

Thus, in passing to the limit as k ^ oo, we obtain 



Aoo= U ■■■nr'^{H,,)n---nf-\Hs,)nH,„ 

Si^S 
2=0, 1,-" 

= U Hs,s,-s,- (25.1.23) 

SiGS 
1=0, 1,--- 



and 
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Hs,s^-s,- = {p€D\ rip) = (25.1.24) 

By Lemma 25.1.3, Aqo consists of an infinite number of /i?i-horizontal 
curves. This follows from the fact that given any infinite sequence made up 
of elements of S, say 

■ * * S/j; * * * , 



the construction process implies that there is an element of Aqo which we 
denote by 



Now, by construction, Hsasi---sk--- is the intersection of the following nested 
sequence of sets 

Hsq T) Hggg-^ D • • • D iLsosi-'-sj, T) • • • , 

and from (25.1.23) it follows that 

d{Hsgsi-sk) ^ 0 as k^oo. 



Hence, by Lemma 25.1.3, Hsgsi---sk--- is a /i/j-horizontal curve. 

It follows that an invariant set, i.e., a set of points that remains in D 
under all iterations by / is given by 



A — {A_oo Aqo} ^ 




C D. 



Moreover, by Lemma 25.1.4, since 0 < PvPh < 1, A is a set of discrete 
points. It should be clear that A is uncountable, and shortly we will show 
that it is a Cantor set. 



Step 2: The Definition of (():A — >• Choose any point p G A; then by 

construction there exist two (and only two) infinite sequences 



* * * 5 

S_iS_2 * * * S—k * * * 7 



Si € S, f = 0, ±1, ±2, • • • , 



such that 



P = K_i 






(25.1.25) 



We thus associate with every point p G A a, bi-infinite sequence made up 
of elements of S, i.e., an element of as follows 
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p I (• • • S-k ■ ■ ■ S-I.SqSI ■■ ■ Sk- ■■) 



(25.1.26) 



where the bi-infinite sequence associated with p, (j>{p), is constructed by 
concatenating the infinite sequence associated with the /Xt,-vertical curve 
and the infinite sequence associated with the /i/j-horizontal curve whose 
intersection gives p, as indicated in (25.1.25). Since a /i/j -horizontal curve 
and a /r„-vertical curve can only intersect in one point (for 0 < PvPh < 1 
by Lemma 4.3.2), the map 4> is well defined. 

Now recall from (25.1.17) that 

K„,s_,...s_,... = [p&D\r^+\p) G K_,,Z= 1,2,...}, (25.1.27) 

and by assumption we have 



/(i7,J = 1^,,; 



thus, (25.1.27) is the same as 

= {p^D\ f-\p) G = 1,2,- • •}. (25.1.28) 

Also, by (25.1.24) we have 

= {p€D\r{p)€Hs„i = 0,l,2,---}. (25.1.29) 

Therefore, from (25.1.26), (25.1.28), and (25.1.29) we see that the bi-infinite 
sequence associated with any point p G A contains information concerning 
the behavior of the orbit of p. In particular, from (j){p) we can determine 
which Hi, i G S contains f^{p), i.e., f^{p) G Hg,.. 

Alternatively, we could have arrived at the definition of </> in a slightly 
different manner. By construction, the orbit of any p G A must remain in 
Hence, we can associate with any p G A a bi-infinite sequence 
made up of elements of S, i.e., an element of , as follows 

P (f){p) = {• • • S_fc • • • S_i.SoSi • • • Sfc • • •} 

by the rule that the element of the sequence <t){p) is chosen to be the 
subscript of the Hi, i G S, which contains f^{p), i.e., /^(p) G Hg,.. This 
gives a well-defined map of A into since the Hi are disjoint. 
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Step 3: (j) is a Homeomorphism. 

We must show that (j) is one-to-one, onto, and continuous. Continuity 
of (f>~^ will follow from the fact that one-to-one, onto, and continuous 
maps from compact sets into Hausdorff spaces are homeomorphisms (see 
Dugundji [1966]). The proof is virtually the same as for the analogous situ- 
ation in Theorem 23.4.1; however, continuity presents a slight twist so, for 
the sake of completeness, we will give all of the details. 



4> is One-to-One. 

Hp')- 



This means that given p,p' G A, if p yf p', then 4>{p) ^ 



We give a proof by contradiction. Suppose p ^ p' and 

(j>(p) = (j){j)') = {• • • S-n • • • S-i.So • • • • • •}, 



then, by construction of A, p and p' lie in the intersection of a p.t,-vertical 
curve lA_j...s_„... and a p./i-horizontal curve .«„.... However, by Lemma 
25.1.4, the intersection of a p/j-horizontal curve and a p^,-vertical curve 
consists of a unique point; therefore, p = p' , contradicting our original 
assumption. This contradiction is due to the fact that we have assumed 
4>{p) = (f){p')] thus, for p ^ p', </>(p) ^ (j>{p'). 



This means that, given any bi-infinite sequence in S^, say 
{• • • s_„ • • • s_i.So • • • s„ • • •}, there is a point p G A such that 4>{p) = 

{• • • S-n ■ ■ ■ S-I-So • • • Sn • • •}• 

The proof goes as follows. Choose {• • • s_fc • • • s_i.soSi • ■ • Sfc • • •} G 

Then, by construction of A = A_oo H Aqo, we can find a p^-horizontal 

curve in Aqo denoted Hsosi---Sk--- ^tnd a pi,-vertical curve in A_oo denoted 
Vs_is^2--s-k-- ■ Now, by Lemma 4.3.2, Hsosi---sk--- and K_i...s_j,... intersect 
in a unique point p G A and, by definition of </>, the sequence associated 
with p, (j>{p), is given by {• • • s_fc • • • S-i.SqSi • • • Sfc • • •}. 



This means that given any point p G A and e > 0, we 
,p) such that 

— p'l < i5 implies d{4>{p), (j){p')) < e, 

where j • j is the usual distance measurement in and d(-, •) is the metric 
on introduced in Section 24.1. 



(j) is Continuous, 
can find a, 5 = 5{e 

\P 



(j) is Onto. 
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Let e > 0 be given; then, by Lemma 24.1.2, if we are to have \(j){p) — 
4>{p')\ < £ there must be some integer N = N{e) such that if 

(j){p) = |...s_„...s_^.sp...s„...}, 

= {. . . . . . 4 . . .}, 

then Si = s\, i = 0, ±1, • • • , ±Ai. Thus, by construction of A, p and p' 
lie in the set defined by Hsq...sn LI Vs_i.-s-n- We denote the ^^-vertical 
curves defining the boundary of TA_i...s_„ by the graphs of x = vi{y) 
and X = V 2 {y)- Similarly, we denote the /i/j-horizontal curves defining the 
boundary of Hgg...sN by the graphs of y = hi{x) and y = /i 2 (x); see Figure 
25.1.3. Note from (25.1.23) and (25.1.15) that we have 



d{Hs„...s^) < Vh, 


(25.1.30) 




(25.1.31) 


Hence, from Definition 25.1.2 we have 


m_ax_ \vi{y) - V 2 (y)\ = ||ui - U 2 II < \ 


(25.1.32) 


yeio.i] 


max \hi(x) — /i 2 (x)| = \\hi — /12 1 < ■ 

x£[0,l] 


(25.1.33) 




1 



FIGURE 25.1.3. 

The following lemma will prove useful in proving continuity of 4>. 

Lemma 25.1.6 

\xi - X 2 I < y— [||wi - U 2 II + pLvWhi - / 12 II] , (25.1.34) 

i flyflh 

\vi - J/ 2 | < [11^1 - h 2 \\ + fj-h\\vi - f 2 ||]. (25.1.35) 
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Proof: This follows from the following simple calculations. 

\xi - X 2 \ = \vi{yi) - V 2 {y 2 )\ 

< \vi{yi) - vi{y2)\ + \viiv2) - V2{y2)\ 

< y^vlyi - V 2 \ + \\vi - V 2 W (25.1.36) 



and 

\yi - 2/2I = \hi{xi) - h2{x2)\ 

< \hi{xi) - hi{x 2 )\ + \hi{x 2 ) - h 2 {x 2 )\ 

< - a;2| + 11^1 - /i2||- (25.1.37) 

Substituting (25.1.37) into (25.1.36) gives (25.1.34), and substituting 
(25.1.36) into (25.1.37) gives (25.1.35). Note that these algebraic manipu- 
lations require 1 — > 0. This proves Lemma 25.1.6. □ 

Now we can complete the proof that </> is continuous. Let pi denote the 
intersection of the graph of hi (x) with vi (y) and p 2 denote the intersection 
of the graph of h 2 {x) with V 2 {y)- Now it follows that 

\p-p'\<\Pi-P2\- (25.1.38) 



We denote the coordinates of pi and p 2 by (xi,yi) and {x 2 , j/ 2 ), respectively. 
Using (25.1.38), we obtain 

\p-p'\ < \xi - X 2 I -h \yi - V 2 \, (25.1.39) 



and using Lemma 25.1.6, we obtain 

\xi - X 2 I + \yi-y2\< [(1 + hh)\\vi - U 2 II 

+ {l + p^)\\hi-h2\\]. (25.1.40) 

Using (25.1.39), (25.1.40), (25.1.32), and (25.1.33) we obtain 

\p-p'\ < [{^ + + {^ + Pv)’^h]- 

1 - PvPh 

Hence, if we take 

S = [(1 -I- Ph)^v ^ -I- (1 -I- llv)l^h] , 



continuity is proved. 
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Step 4 .: (j) o f = a o 4>. Choose any p G A and let 

4>{p) = {• • • s_fe • • • s_i.soSi • • • Sfe • • •}; 

then 

a o (j){p) = {• • • s_fc • • • s-iso-si • • • Sfc • • •}. (25.1.41) 

Now, by definition of </>, it follows that 

4> o f{p) = {• • • s-fc • • • s_iSo.si • • • Sfc • • •}; (25.1.42) 

hence, from (25.1.41) and (25.1.42), we see that 

^ o f{p) = 0-0 Hp) 

and p is arbitrary. This completes our proof of Theorem 25.1.5. □ 



25.2 Sector Bundles 

In Chapters 26 and 27 we will see that certain orbits called homoclinic orbits 
give rise to the geometrical conditions that allow Assumption 1 to hold in 
a two-dimensional map. However, a direct verification of Assumption 2 is 
not easy. When one thinks of stretching and contraction rates of maps, it is 
natural to think of the properties of the derivative of the map at different 
points. We now want to derive a condition that is equivalent to Assumption 
2 that is based solely on properties of the derivative of / (hence we must 
assume that / is at least C^). We begin by establishing some notation. 

We define 

= (25.2.1) 

and 

H, n r\H,) = = r\Vj,) (25.2.2) 

for i,j G S where S = A^} (iV > 2) is an index set; see Figure 

25.2.1. We further define 

H= \J H,, 

i,j€S 



and 
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V2=f(H2> V,=f(H,) 




FIGURE 25.2.1. N = 2 for illustrative purposes. 

v= U 

i,j€S 



It should be clear that 

fin) = V. 

We now want to strengthen our requirements on / by assuming that / 
maps H diffeomorphically onto V. 




FIGURE 25.2.2. 

For any point zq = (xq, yo) G "H IJ V, we denote a vector emanating from 
this point by {^zoiVzo) ^ s®® Figure 25.2.2. We define the stable sector 

at Zq as follows 



^Zo ~ {ii^o^Vzo) G I l^zol — l^h\^zo\} ■ 



(25.2.3) 
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Geometrically, defines a cone of vectors emanating from zq, where /ih 
is the maximum of the absolute value of the slope of any vector in the 
cone and slope is measured with respect to the a;-axis; see Figure 25.2.3. 
Similarly, the unstable sector at Zg is defined as 

5“ = G K' I l6ol < l^v\v.o\}- (25.2.4) 

Geometrically, 5“^ defines a cone of vectors emanating from zq, where 
is the maximum of the absolute value of the slope of any vector in the cone 
and slope is measured with respect to the y-axis; see Figure 25.2.3. We will 
put restrictions on fj,y and Hh shortly. 




FIGURE 25.2.3. 

We take the union of the stable and unstable sectors over points in TL 
and V to form sector bundles as follows 

Sv=[J 

[J 

zoGH 

Sv=lj 

zoev 

We refer to as the stable sector bundle over "H, as the stable sector 
bundle over V, 5)^ as the unstable sector bundle over H, and as the 
unstable sector bundle over V. 



Now we can state our alternative to Assumption 2. 
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Assumption 3. Df{S^) C Sy and Df ^(Sy) C S^. 

Moreover, if (6o:»7^o) G and D/(zo)(U> = i^f{zo),Vf{zo)) G 
^f(zo)’ have 

\Vf{zo) \ > \Vzo\- 

Similarly, if {^zo,Vzo) G and Df~\zo){^zo,Vzo) = i^f-Hzo),Vf-^zo)) G 
^f-Hzo)^ then 

\^f~Hzo) \ > \^zo\, 

where 0 < /x < 1 — pLyt^h', see Figure 25.2.4. We remark that the notation 
Df(S^) C 5y is somewhat abbreviated. More completely, it means that for 
every zq G H, {^zo,Vzo) G 5“^, we have Df{zo){^zo,Vzo) = {^f(zo),Vf(zo)) G 
similarly for Df~^{Sy) C S^. We now state the main theorem. 



D«znl n't 




FIGURE 25.2.4. 



Theorem 25.2.1 If Assumptions 1 and 3 hold with 0 < /x < 1 — pivP-h, 
then Assumption 2 holds with Vh = Vy = m/(1 ~ P'vP-h)- 
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Proof: We will prove only the part concerning horizontal strips; the part 
concerning vertical strips is proven similarly. The proof consists of several 
steps. 

Step 1. Let H he & /i?i-horizontal curve contained in Then show 

n Hi = Hi is a /i^-horizontal curve contained in Hi for all 

i € S. 

Step 2. Let iL be a /ift,-horizontal strip contained in Then use 

Step 1 to show that f~^{H) (1 Hi = Hi is a ^ji-horizontal strip for 
each i G S. 

Step 3. Show that d{Hi) < (/r/(l — d{H). We begin with Step 1. 

Step 1. Let H C UjeS ^3 ^ -horizontal curve. Then H intersects both 

vertical boundaries of each Vi \/ i G S. Hence, f~^{H) D Hi is a curve for 
each i G S' by Assumption 1. 

Next we argue that f~^{H) (1 Hi is a ^^-horizontal curve Vi G S. 

This follows from Assumption 3, since Df~^ maps Sy into Let 
(xi, j/i), (x 2 ,j/ 2 ) be any two points on f~^{H) iT Hi, i fixed; then by the 
mean value theorem 

\yi - V2\ < p.h\xi - X2I. 

Thus, f~^{H) n Hi is the graph of a ^/j-horizontal curve y = h{x). 

Step 2. Let H C U^eS ^ /ift,-horizontal strip. Then applying Step 1 to 

the horizontal boundaries of H shows that f~^{H) D Hi is a /i/j-horizontal 
strip for every i G S. 

Step 3. Fix i and choose points po and pi on the horizontal boundary of 
Hi having the same x-components such that 

= bo-Pi|. (25.2.5) 

Consider the vertical line connecting po and pi defined as follows 
p{t) = tpi + {1 - t)po, 0 < t < 1; 

see Figure 25.2.5. Then it should be obvious that 

p{t) = Pi — Po G Sfi VO < t < 1. 
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FIGURE 25.2.5. 

Next we consider the image of p{t) under /, which we denote by 
f{p{t)) = z{t) = {x{t),y{t)), 0 < t < 1. 

It should be clear the z{t) is a curve connecting the two horizontal bound- 
aries of as shown in Figure 25.2.5. We denote the endpoints of the curve 
by 

/(p(0)) = Zo = (xo,yo) 



and 

/(p(l)) = zi = (xi,yi). 



Moreover, since H is /i^i-horizontal, Zq lies on a /x/j-horizontal curve that 
we denote by y = ho{x) and z\ lies on a -horizontal curve that we denote 
by y = h\{x). Tangent vectors to z{t) are given by 

z{t) = Df{p{t))p{t). (25.2.6) 



Using (25.2.6) and the fact that from Assumption 3 Df(S^) C we 
conclude that z{t) is a /Xt,-vertical curve. Therefore, applying Lemma 25.1.6 
to z{t), y = ho{x), and y = hi{x) we obtain 



1 

1 - IJ^vPh 



11^0 — ^i|| 



1 

1 - pivp^h 



d{H). 



\yo - yi\ < 



(25.2.7) 
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Also by Assumption 3 we have 

\y{t)\ > -\p{t)\ = -\pi - Pol- (25.2.8) 

Integrating (25.2.8) gives 

\pi-Po\<p[ \y{t)\dt < n\yi - yo\- (25.2.9) 

Jo 

Finally, (25.2.7) and (25.2.9) along with (25.2.5) give 
rf(i^^) < ^ d{H). □ 

1 - PvPh 



25.3 Exercises 



1. Prove Eq. (25.1.38). 

2. This problem was first studied in detail by Philip Holmes (see Guckenheimer and 
Holmes [1983] for additional references). We consider the mechanical system consisting 
of a small ball bouncing vertically on a massive vibrating table, where each impact is 
governed by the relationship 

V{tj) - W{tj) = -a (C/fe) - W{tj)) 



where U, V, and W are, respectively, the absolute velocities of the approaching ball, 
the departing ball, and the table, 0 < a < 1 is the coefficient of restitution, and 
t — tj is the time of the impact. We assume that the distance the ball travels 
between impacts under the influence of gravity , p, is large compared with the overall 
displacement of the table, then the time interval between impacts is approximated by 



4-1 t -j — 



2T(C 



and the velocity of approach at the (j + 1 )®* impact is 

c/(G+i)^-y(G)- 



Combining these relationships, we obtain a recurrence relationship relating the state 
of the system at the {j + 1 )®^ impact to that at the of the following form: 

2Vi 

tj + i — tj H , 

9 

2V4 

y, + i = aVj + (1 + a)W{ti + — 

9 



where we use the notation V{tj) = Vj^ etc. 

Assuming that the table motion is sinusoidal of the form — /3sino;t, and nondimen- 
sionalizing, the map, which we henceforth refer to as /, takes the form 



/ 4>j+i — 4>j + '<^j 
X vj + 1 = avj - 7 cos{4>j + Vj) 



(25.3.1) 



where (p — ujt^ v — , and 7 — (1 + a)P/g. 
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(a) Verify that the inverse of the map is given by 

r-i . / ^3-1 =<^3 - h ( 7 COS 03 + Vj) 

\ ^3-3 = 5 (7COS0J + Vj) 



(25.3.2) 



(b) Verify that the map is area preserving for oc — 1 and contracts areas uniformly 
for Q; < 1 . 

(c) Prove the following lemma, which establishes assumption 1 of the Conley-Moser 
conditions. 




FIGURE 25.3.1. 

Lemma 25.3.1 For a — 1 and 7 > 47 t one can find horizontal and vertical 
strips Hi, Vi such that f{Hi) — Vi, i — 1,2. Moreover horizontal boundaries 
of Hi map to horizontal boundaries of Vi and vertical boundaries of Hi map 
to vertical boundaries of Vi, i — 1,2. 

Hint: In Fig. 25.3.1 consider the parallelogram ABCD bounded by the lines 

0 + 17 — 0 (-^B) 

0 + 17 — 27T ( CD) 

0-0 (AD) 

0 - 27t (BC) 

The parallelogram is foliated by the lines cj) v — k, k ^ [0,27t]. Show that 
the image of such lines under f are the vertical lines (p — k, v ^ [^ ~ — 

7 cos/t,/t — 7 cos fc] . The images of the boundaries 0 — 0 and 0 — 27t are the 
curves 17 — 0 — 7 cos 0, 17 — 0 — 27t — 7 cos 0. 

(d) Prove the following lemma, which establishes assumption 3 of the Conley-Moser 
conditions. 

Lemma 25.3.2 For 7 sufficiently large (5 tt is sufficient), there are sector 
bundles S^{p), S^{p) based at points p G IJi j = i 2 ^^^ ^ U) centered on the 
lines 0 — constant and cp v — constant, respectively, and each of angular 
extent such that Df{S^{p)) C (p) and Df~^{S^{p)) C S^{p). Moreover, 
Df{p) expands vertical distances by a factor of at least 5.5 and Df~^ expands 
horizontal distances by a factor of at least 4.5. 

Hint: let 



Df 



1 1 
r 1 + r 



D /-1 = 



f 1 + r -1 
\ -r 1 



) 



ly/iere r = 7 sin(0 + 17) or 7 sin 0, and see Fig. 25.3.2. 
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FIGURE 25.3.2. a) Behavior of unstable sectors under Df. b) Behavior of stable 
sectors under Df~^. 



(e) Use the previously proved results to establish the following theorem. 

Theorem 25.3.3 For 7 > Stt and a — 1 the map possesses an invariant, 
hyperbolic Cantor set A on which /|a is topologically conjugate to the shift 
on two symbols. 

(f) Give a physical description of the dynamics in the invariant Cantor set for this 
problem. 

3. This problem is concerned with the Henon map, 




the condition 



|a:| > A 



l+l^l 

2 



divides S into two vertical strips, Vi and V 2 , and the condition 
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\V\ > A 



l + |fe| 

2 



divides S into two horizontal strips, Hi and H 2 , which obey assumption 1 of 
the Conley-Moser conditions. 

(b) Consider the sectors 



~ {(^Zq,11zq) I IVzqI > -^I^ZqI} ■ 

Prove that if (25.3.4) holds then we can choose A > 2 so that in Vi and V 2 
is invariant under DH{x, y) and in the Hi and H 2 is invariant under 
DH~^ {x, y). Thus, conclude that assumption 3 of the Conley-Moser Conditions 
hold. 

(c) Prove that the Henon map has a hyperbolic invariant Cantor set on which it is 
topologically conjugate to the shift on two symbols. 

4. Horseshoes are Structurally Stable. Suppose that a map f:D—¥ satisfies the 
hypothesis of Theorem 25.2.1. Then it possesses an invariant Cantor set A. Show that, 
for e sufficiently small, the map f eg (with g C’’, t > 1, on D) also possesses an 

invariant Cantor set Ag. Moreover, show that Ag can be constructed so that {f-\-eg)\ 

lAg 

is topologically conjugate to /I . 
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Dynamics Near Homoclinic 
Points of Two-Dimensional 
Maps 



In Chapter 25 we gave sufficient conditions for a two-dimensional map 
to possess an invariant Cantor set on which it is topologically conjugate 
to a full shift on N symbols. In this section we want to show that the 
existence of certain orbits of a two-dimensional map, specifically, transverse 
homoclinic orbits to a hyperbolic fixed point, imply that in a sufficiently 
small neighborhood of a point on the homoclinic orbit, the conditions given 
in Chapter 25 hold. There are two very similar theorems which deal with 
this situation; Moser’s theorem (see Moser [1973]) and the Smale-Birkhoff 
homoclinic theorem (see Smale [1963]). We will prove Moser’s theorem and 
describe how the Smale-Birkhoff theorem differs. 

The situation that we are considering is as follows: let 




be a C’’ (r > 1) diffeomorphism satisfying the following hypotheses. 
Hypothesis 1. / has a hyperbolic periodic point, p. 

Hypothesis 2. W^{p) and W^{p) intersect transversely. 

Without loss of generality we can assume that the hyperbolic periodic 
point, p, is a fixed point, for if p has period k, then f^{p) = p and the 
following arguments can be applied to /^. A point that is in IT®(p) fl W“(p) 
is said to be homoclinic to p. If IT®(p) intersects IT“(p) transversely in a 
point, then the point is called a transverse homoclinic point. Our goal is to 
show that in a neighborhood of a transverse homoclinic point there exists 
an invariant Cantor set on which the dynamics are topologically conjugate 
to a full shift on N symbols. Before formulating this as a theorem, a few 
preliminary steps need to be taken. 
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Step 1: Local Coordinates for f. Without loss of generality we can assume 
that the hyperbolic fixed point, p, is located at the origin (cf. Section 3). 
Let [/ be a neighborhood of the origin. Then, in U, f can be written in the 
form 



C I — 1 AC + 5i(C,»7), 

p I — m + 92{£.,ri), 



if,v) G C/C 



(26.0.1) 



where 0 < |A| < 1, |^| > 1, and gi, g 2 are 0(2) in ^ and g. Hence, g = 0 and 
^ = 0 are the stable and unstable manifolds, respectively, of the linearized 
map. However, in the proof of the theorem we will find it more convenient 
to use the local stable and unstable manifolds of the origin as coordinates. 
This can be done by utilizing a simple (nonlinear) change of coordinates. 



We know from Theorem 3.2.1 that the local stable and unstable mani- 
folds of the hyperbolic fixed point can be represented as the graphs of C’’ 
functions, i.e.. 



Wi:,,(0) = graph /r^(e), 
Wi“,(0) = graph h%g), 



(26.0.2) 



where /i®(0) = /i“(0) 
transformation 



= Dh^{0) = H/i“(0) = 0. If we define the coordinate 
{x, y) = {f- lC{g),g - h\^)), (26.0.3) 



then (26.0.1) takes the form 



XI — 1 Ax -k fi{x,y), 

2 / 1 — > ny + f2{x,y), 



(26.0.4) 



with 



/i(0,2/) = 0, 

/2(x,0) = 0. 



(26.0.5) 



Equation (26.0.5) implies that y = 0 and x = 0 are the stable and unstable 
manifolds, respectively, of the origin. We emphasize that the transforma- 
tion (26.0.3) is only locally valid; therefore (26.0.4) has meaning only in a 
“sufficiently small” neighborhood of the origin. Despite the bad notation, 
we refer to this neighborhood, as above, by U. 



Step 2: Global Consequences of a Homoclinic Orbit. By assumption, 1T'*(0) 
and 1T“(0) intersect at, say, q. Then, since q G W®(0) fl W“(0), 

lim r{q) = 0, 

n—^oo 

lim r{q)=0. 



(26.0.6) 
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Therefore, we can find positive integers fco and k\ such that 

= qoeU, 
f-^^{q) = q,GU. 



(26.0.7) 



In the coordinates in U we denote 



qo = (a;o,0), 
qi = (o,yi)- 



It follows from (26.0.7) that 

f'^iqi) = Qo, 



where k = ko + ki; see Figure 26.0.1. 




FIGURE 26.0.1. 

Next we choose a region V as shown in Figure 26.0.1 with one side along 
IF'*(0) emanating from q, one side along IF“(0) emanating from q, and the 
remaining two sides parallel to the tangent vectors of IF®(0) and IU“(0) 
at q. Now V can be chosen to lie on the appropriate sides of IF®(0) and 
IF“(0) and taken sufficiently small so that 

r^^{v) = vi c u 
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and 

fo(y) = VoCU (26.0.8) 

appear as in Figure 26.0.1. From (26.0.8) it follows that we have 

= Vo. (26.0.9) 



Let us make an important comment concerning Figure 26.0.1. The im- 
portant aspect is that we can choose V and (large) positive integers ko and 
ki such that and are both in the first quadrant, and dis- 

joint. This can always be done and is left as one of the exercises at the end 
of this section. (Note: Certainly k depends on the size of U as U shrinks 
to a point k — 1 oo.) We remark that in Figure 26.0.1 we depict 1F'*(0) 
and VF“(0) as winding amongst each other. We will discuss the geometrical 
aspects of this more fully later on; for the proof of this theorem a detailed 
knowledge of the geometry of the “homoclinic tangle” is not so important. 
However, the one aspect of the intersection of IF^(O) and 1F“(0) at q that 
will be of importance is the assumption that the intersection is transversal 
at q. Since / is a diffeomorphism, this implies that 1F®(0) and VF“(0) also 
intersect transversely at f^°{q) = qo and f~^^{q) = qi (or, more generally, 
at f^{q) for any integer k). 

The next step involves gaining an understanding of the dynamics near 
the hyperbolic fixed point. 



Step 3: Dynamics Near the Origin. We first state a well-known lemma that 
describes some geometric aspects of the dynamics of a curve as it passes 
near the hyperbolic fixed point under iteration by /. 

Let q G 1F®(0) — {0}, and let C be a curve intersecting 1F®(0) transversely 
at q. Let denote the connected component of f^{C)r\U to which f^{q) 
belongs; see Figure 26.0.2. Then we have the following lemma. 



Lemma 26.0.4 (The lambda lemma) Given e > 0 and U sufficiently 
small there exists a positive integer Nq such that for N > Nq is 
e-close to kF“(0) fl U. 



Proof: Proofs can be found in Palis and de Melo [1982] or Newhouse [1980]. 
Here we give a proof for two-dimensional diffeomorphisms. 

Without loss of generality we can assume q G U. Moreover, we will take 
U of the form 
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FIGURE 26.0.2. 



where is an interval on the x axis containing the origin and ly is an in- 
terval on the y axis containing the origin. We denote the partial derivatives 
of fi and /2 by fix, fiy, f 2 x, f 2 y Since they are zero at the origin, for U 
sufficiently small we can find a constant k such that 

1> k> sup||^/la:|, \fly\, \f2x\, |/2y|| , (26.0.10) 



and 



|A| 1-Hfc |A| 1 + k 
|/i| 1 — fc \fj,\ 1 — 5k 
1 < |/r| — 2k, 



(26.0.11) 

(26.0.12) 



1^1-* (i-RT^i) 



< 2 . 



(26.0.13) 



Let Vo = (vqjVq) denote a unit vector tangent to C at q. Since we are 

assuming that C intersects the x axis transversely at q we have Vq 0. We 

denote the slope of vg by Aq = 1^) and we denote iterates of q and vq by 

I'^O I 

91 = /(?). M = Df{q)vo, 

92 = f{qi), V 2 = Df{qi)vi, 

q-a = /((?n-l), Vn = £>/( 9 „- 1 )w„- 1 . 



(26.0.14) 
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The steps in the proof of the lemma are as follows. 



Step 1. Estimate the slopes of the iterates of vq under Df and show that 
they are bounded for all n> uq. 

Step 2. The estimates of step 1 are for a vector tangent to C at q. Using 
continuity of tangent vectors with respect to the point of tangency, we 
extend the estimates of step 1 to all tangent vectors in some smaller 
curve, C, contained in /””(C) that passes through We then 

estimate the slopes of an arbitrary vector on C under iteration by 

Df. 

Step 3. Once the estimates for the slopes of iterates of tangent vectors 
to C are obtained we then show that C stretches in the direction of 
TU“(0) n U under iteration by /. 



We begin with the first step. We estimate the evolution of the slope of 
Vq under iteration by Df. We have 



Df{q)vo 



(>^ + fix{x,0) fiy{x,0) \ / 

V 0 M + /2y(S,0)yV 

f {>> + fix{x,0))v^ + fiy{x,0)v 

V ifJ'+ f2y{x,0))v^ 



V 

V 

y " 
0 



X 

0 

y 

0 




,(26.0.15) 



where the zero entry in the matrix arises since / 2 (a^, 0 ) = / 2 a:(^j 0 ) = 0. 
Using (26.0.15) and (26.0.10), we obtain the following estimates 



^ ^ \>^vE + hxVE + flyVol 

\vf\ lMi’o+/2y«ol 
^ \>^vE + flxVE\ , l/lyl 
“ lM^'o +/2 !/«oI \f^ + hyy 

^ |A|K| |1+»I , \fly\ 

~ \^^\\Vo\ \l + ^\ \f^ + f2y\' 

<Ua 1 + fc I fc 

- |/i| 



(26.0.16) 



Henceforth we will not explicitly denote the argnments of the partial 
derivatives fix,fiy,f 2 x,f 2 y as we will always estimate them using the 
uniform estimate in (26.0.10). This will always apply since they will 
always be evaluated on points in U. 

Repeating the calculation in (26.0.15), and the estimates used in (26.0.16) 
gives 
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A 2 = 



< 



kfl ^ \>^vf + flxVf + flyVfl 

\vl\ IK + /2y<l 

Ba ^ + I fc 
ImI 



(26.0.17) 



Repeating these same calculations and estimates, at the n*'' step we obtain 
the following estimate 



A <Ba I fc 



(26.0.18) 



or, 



A„ < 



< 



|A| 1 + fe 

|A| 1 + fe 
\a\l-k 



An -I- 



An + 



k ^V|A|l-ffc 



— fell A_i±fc 

' ^ fi l-k 



(26.0.19) 



Since, by (26.0.11), — >■ 0 as n — >■ 00 we can find an integer no 

such that 

An < 3 for all n > no- (26.0.20) 



This completes step 1. 

In step 2 we begin by considering a smaller neighborhood of the origin, 
but one that is only shrunk in the x direction. Let (5 be a small positive 
real number and let Six denote Ix multiplied or “scaled” by S. Let 

Ul = Six X ly. 



Then we can choose S so small that 

sup|/iy|</ci ( 26 . 0 . 21 ) 

Ui 

where 

( 26 . 0 . 22 ) 

By continuity of tangent vectors, we can find a curve in C, 

containing such that the slope A of any unit vector on this curve 

obeys 
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^no ^ 4, 



(26.0.23) 



see Fig. 26.0.3. 




FIGURE 26.0.3. 



Let p denote any point in C and let v denote a vector tangent to C at 
p. We calculate the iterates of the slope of v. As earlier, we have 



Df{p) 



{ XV- + fl.V- + flyVy\ 

V hxv"" + pyy + f2yvy j 



X \ 

no + 1 1 

y ) ) 
no + 1 / 



(26.0.24) 



which is used to obtain the following estimates 

, ^ + flxV"" + flyV^l 

\f2xV^ + pyy + f2yvy\ 

^ \Xv^ + flxV^\ IflyV^l 

~ \f2xV^ + pyy + f2yvy\ \f2xV^ + pvy + f2yvy\ 

^|A|KI Il + ¥l , fci 

- \p\\vy\\l+ \fi+ f2y + f2x^\ 

^ 1 + fc ^ 

“1^1 1 - fc - kXno \p\ - k - kXno 



(26.0.25) 
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or, using (26.0.23), 



|A| 



1 -I- A: 



ki 



Xno + l < |;x|-5A:- 



(26.0.26) 



Iterating these calculations as above gives 
'|A| 1 + A: 



^riQ+n ^ 



Ann + 






(26.0.27) 



Since — >■ 0 as n — >■ oo, and A„j, is bounded by (26.0.23), we can 

find an integer h such that 



/|A| 1 + fc 
V|/x| 1 - 5k 



Ko < 2 



for n > n. Using this, along with (26.0.22), gives 

Ang-t-n ^ e 



(26.0.28) 



for n > n. Note that this calculation is valid for any tangent vector in 
/"((7). We can take Nq in the statement of the lemma to be no + n. This 
completes step 2. 



In step 3 we compare the norm of a tangent vector to that of its image 
under Df, i.e., we estimate 



/K-kP + 




./ bia;|2 1 


bb|2 



kn-l-ll /^n+1 + 1 

KI V + i ■ 



(26.0.29) 



For n sufficiently large this expression is arbitrarily close to 



An 



0 as n — >■ oo. We also have that 



hii+il 

bill 



since 



'Jn+l\ 

\vl\ 



\f2xv^ + 



hyvll 



yn\ 



— |/2a;An + fJ, + f2y\ > \fJ,\ — 2k > 1 



(26.0.30) 



for n large by (26.0.12) and (26.0.28). Thus we see that the norms of the 
iterates of nonzero tangent vectors are growing by a ratio that approaches 
1^1 — 2fc > 1. Hence, /"(K is stretching in the direction of IU“(0) fl U. 
This, together with (26.0.28), proves the lemma. □ 
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We make several remarks regarding the lambda lemma. 

Remark 1. The lambda lemma is valid in n-dimensions (Palis and de Melo 
[1982], Newhouse [1980]) and even oo-dimensions (Hale and Lin [1986], 
Lerman and Silnikov [1989], Walther [1987]). However, the statement of the 
lemma requires some technical modifications. A continuous time version of 
the lambda lemma can be found in Deng [1989a, b]. 

Remark 2. The phrase e-close implies that tangent vectors on are 
e-close to tangent vectors on 1T“(0) fl U. By our choice of coordinates in 
U, all vectors tangent to W“(0) fl U are parallel to (0, 1). 

Remark 3. It should be clear from step 3 of the proof that the estimates 
involved in proving the lambda lemma give us information on the stretch- 
ing of tangent vectors. In particular, let zq G with (^20,772^) 

a vector tangent to at Zq. It follows that Df^{zo){^zojVzo) = 

(C/"(zo)’ ^/'^(^o)) ^ vector tangent to at f^{zo). Then 

can be made arbitrarily small by taking N large enough, and 

can be made arbitrarily large by taking N large enough; see Figure 26.0.4. 

We now define the transversal map, f^, of Vq into Vi as follows. Let 
D{f'^) denote the domain of and choose p € Vq- We then say that 
p G if there exists an integer n > 0 such that 

r{p) G 



and 

(26.0.31) 

Next we define 

f{p) = rip) e ^1, (26.0.32) 

where n is the smallest integer such that (26.0.32) holds. 



Step 4- The Dynamics Outside ofU. Recall from (26.0.9) that we have 
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/ 

/ ‘ 




FIGURE 26.0.4. 
f{Vi) = Vo. 



Hence, since Vi C U, can be represented in the x — y coordinates as 
follows 








( Mx,y)\ 

\(t)2{x,y) ) ’ 



(x,y)GHi, (26.0.33) 



where y = y — yi, 4>i{x,y) and (f> 2 {x,y) are 0(2) in x and y, a,b,c,d are 
constants, and, from (26.0.7), qo = (xo,0) and qi = (0,j/i). 

Step 5: The Transversal Map of Vq into Vg. Using Steps 3 and 4 we have 
that 

f o f: D{f) CVo^Vo (26.0.34) 

is a transversal map of D{f'^) C Vq into Vq- 
Now we can finally state Moser’s theorem. 



Theorem 26.0.5 (Moser [1973]) For k sufficiently large the map /*o/^ 
has an invariant Cantor set on which it is topologically conjugate to a full 
shift on N symbols. 



Proof: The strategy is to find /x/j-horizontal strips in Vq that are mapped 
homeomorphically onto /i„-vertical strips in Vq with proper behavior of the 
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boundaries such that Assumptions 1 and 3 of Chapter 25 hold. Theorem 
26.0.5 will then follow from Theorem 25.2.1. We remark that we will take 
as the horizontal boundaries of Vq the two segments of the boundary of 
Vq that are “parallel” to W®(0) and as the vertical boundary of Vq the 
remaining two segments of the boundary of Vq. Similarly, the horizontal 
boundary of Vi is taken to be the two segments of the boundary of Vi 
parallel to 1T®(0), and the horizontal boundary of Vi is taken to be the 
remaining two segments of the boundary of Vi; see Figure 26.0.5. 




FIGURE 26.0.5. 



We begin by choosing a set of /x/j-horizontal strips in Vq such that As- 
sumption 1 of Section 25.1 holds. First we make the observation that it 
follows from the lambda lemma that there exists a positive integer Ng such 
that for N > Nq both vertical boundaries of the component of /^(Vg) fl U 
containing f^{qo) intersect both horizontal boundaries of Vi as shown in 
Figure 26.0.6. 

Let Vn = f^{Vo) n Vi denote this set. Then, for Nq sufficiently large, it 
follows by applying the lambda lemma to that /“^(Uat) = Hn is a 
/x/j-horizontal strip stretching across Vq with the vertical boundaries of 
contained in the vertical boundaries of Vq as shown in Figure 26.0.6. 

(Note: since the tangent vectors at each point on the horizontal bound- 
aries of Hn can be made arbitrarily close to the tangent vector of IF® (0) fit/, 
it follows that the horizontal boundaries of are graphs over x.) It should 
be clear from the definition of that H]s[ C D{f^). 



Now we choose a sequence of integers, Nq + ji, Nq + j2, - ■ ■ with 

V^ = fNo+ji^VQ)nVi (26.0.35) 
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FIGURE 26.0.6. 

as described above. The sequence {ji, J 2 , • ‘ ‘ j Jn) ‘ ‘ is chosen such that 
the Vi are disjoint. Then 

f-No-h^y.-)=H,, (26.0.36) 

are a set of disjoint ^^-horizontal strips contained in Vq. It follows by 
applying the lambda lemma to f~^ that, for sufficiently large, is 
arbitrarily close to zero. We choose a finite number of the Hi 



Then o f'^(Hi) = Vi, i = 1, ■ ■ ■ , N appear as in Figure 26.0.7. A consid- 
eration of the manner in which the boundary of Vi maps to the boundary 
of Vo under /* shows that horizontal (resp. vertical) boundaries of the Hi 
map to horizontal (resp. vertical) boundaries of the Vi under /* o 

We now need to argue that the Vi are /i„-vertical strips with 0 < < 

1. This goes as follows. By the lambda lemma, for Nq sufficiently large, the 
vertical boundaries of the F) are arbitrarily close to TT“(0) fl U. Hence, by 
the form of given in Step 4, the vertical boundaries of f^{Vi) = Vi are 
arbitrarily close to the tangent vector to IF“(0) at qq. Therefore, the vertical 
boundaries of the Vj can be represented as graphs over the y variables. 
Moreover, by the remark above, fih can be taken as small as we like (by 
taking Nq large); it follows that we can satisfy 0 < where fiy 

is taken to be twice the absolute value of the slope of the tangent vector 
of VU“(0) at qo- (Note: the reason for taking this choice for will be 
apparent when we define stable sectors and verify Assumption 3.) Hence, 
Assumption 1 holds. 

Next we need to show that Assumption 3 holds. We will show this for 
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FIGURE 26.0.7. 

the unstable sectors and leave it as an exercise for the reader to verify the 
part of Assumption 3 dealing with the stable sectors. First we must define 
the unstable sector bundle. 

Recall from Section 25.2 that 

f o f{H,) n = v,i, 

n {f^ O o (26.0.37) 

with 

H= [j H,j and V = |J Vji, (26.0.38) 

i,j&S i,j&S 

where S = {1, • • • , N} {N > 2) is the index set. We choose Zq = {xq, Do) G 
"H U V; then the unstable sector at zq is denoted by 

5“ = e R" I lUI < (26.0.39) 

and we have the sector bundles 

5^- U 5“ and = \J 5“. 

zq^I-L zq^V 



(26.0.40) 
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We make the important remark that by our choice of a vector parallel 
to the tangent vector of W“(0) at <70 is contained in the interior of see 
Figure 26.0.8. 




FIGURE 26.0.8. 

We must show that 

1) D{fof){S^)cS^ (26.0.41) 

and 

2) h/^o/M.o)l> (^) h^ol; (26.0.42) 

where 0 < /i < 1 — fivfJ-h- 

Before showing that 1) and 2) hold we make the following observations. 
Observation 1. 



D{fof) = DfDf (26.0.43) 

and, from (26.0.33), 

= + S)- 

Now, since (pi{x,y) and 4>2{x,y) are 0(2) in x and y, by choosing U suffi- 
ciently small (j)ix, </>ij 7, 4 > 2 x, and <f> 2 y can be made arbitrarily small compared 
to a, 6, c, and d. 
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Observation 2: Consequences of Transversality. From Step 2, PF'*(0) and 
PF“(0) intersect transversely at qo and q\. We have 

Df\<li)=(l d)> (26.0.45) 

and 

!>/-*(*) = = (26.0.46) 



In our choice of coordinates a vector tangent to W“(0) at < 7 i is parallel to 
(0,1), and a vector tangent to VF^(O) at qo is parallel to (1,0). Hence, if 
1F'*(0) and 1F“(0) intersect transversely at qo and qi, we must have that 



Df{qi) 



a I 0 
c d [l 



is not parallel to 



and 



Df-%qo) 



1 



ad — be 
1 

ad — be 



d -b 
—e a 

d 
—e 



is not parallel to 




It is easy to see that these conditions will be satisfied provided 

dyf 0. 



Note that (b,d) is a vector parallel to the tangent vector to 1F“(0) at qo- 
We will use this later on. 

Now we return to showing that Assumption 3 holds for the unstable 
sector bundle. We must demonstrate the following. 
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1. and 

2- h/'=o/r(zo)l > l.\'nzo[. 0 < < 1 - 



D{f^of^){S^)cS^ 



We choose zq €% and let {^zo^Vzo) G and 

Df{zo)iU,Vzo) = i^r(zo),VfTizo))- (26.0.47) 



Then, using (26.0.43) and (26.0.44), we have 



Df{f{zo))Df{zomzo,Vzo) 



/ (a + 4'ix)^fT(zg) + (6 + 4>iy)VfT{zo) \ 
V (c + 4'2x)^fT(zo) + (c^ + 4>2y)'llfT(zo) ) 



f ^/'‘orizo) \ ^ 
\Vf>^or(zo)J ’ 



(26.0.48) 



where all partial derivatives are evaluated at zq = (xo,yo). We must show 
that 

I^/'°°/^(zq)I _ \i<^+ 4>lx){^fT(zo))/{'nfT(zo)) + {b+ 4’ly)\ 
\Vf>‘ofT(zo)\ \{c+ 4>2x){^fT(zo))/i'nf'r{zo)) + (d+ 4’2y)\ 

G (26.0.49) 

From Remark 3 following the lambda lemma, for Nq sufficiently large, 

l^/^Do)l 

\'nnzo)\ 

can be made arbitrarily small. Also, from Observation 1 above, for V suf- 
ficiently small, 

4^1x1 4ly: 42 x<! ^2y 

can be made arbitrarily small. Hence, using these two results we see that 

I^/'°°/^(zq) I 

I^/*'0/^(20) I 

can be made arbitrarily close to 

M 

Ml’ 
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where, as shown in Observation 2 above, (&, d) is a vector parallel to the 
tangent vector of bb“(0) at qo] see Figure 26.0.9. Since this holds for any 
zq G %, we have D{f^ o C 5y. 

DP(Zo)Sz„ 




FIGURE 26.0.9. 



l^/'“o/^Oo)l ^ < II<1- 



Using (26.0.48) we have 

b/'=o/^(zo)l _ \{c + 4>2x)£,fT(^za) + (t^ + ())2y)t?/r(zo)| 

l%ol l%ol 



Again, as a result of the lambda lemma, for No sufficiently large, \'nf'^(zo)\ 
can be made arbitrarily large, |^/T(zp)| can be made arbitrarily small, and 
by transversality of the intersection of IU“(0) and FF®(0) at <7, d yf 0 (with 
4>iy small compared to d). Thus, (26.0.50) can be made as large as we desire 
by choosing Nq big enough. □ 



The Smale-Birkhoff Homoclinic Theorem 

The Smale-Birkhoff homoclinic theorem is very similar to Moser’s theo- 
rem. We will state the theorem and describe briefly how it differs. The 
assumptions and set-up are the same as for Moser’s theorem. 
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Theorem 26.0.6 (Smale [1963]) There exists an integer n > 1 such 
that /" has an invariant Cantor set on which it is topologically conjugate 
to a full shift on N symbols. 

Proof: We will only give the barest outline in order to show the difference 
between the Smale-Birkhoff homoclinic theorem and Moser’s theorem and 
leave the details as an exercise for the reader. 

Choose a “rectangle,” Vq, containing a homoclinic point and the hyper- 
bolic fixed point as shown in Figure 26.0.10. Then, for n sufficiently large, 
/"(ho) intersects Vq a finite number of times as shown in Figure 26.0.10. 




FIGURE 26.0.10. 




FIGURE 26.0.11. Horizontal strips Hi, ■ ■ ■ ,Ha and their image under /". 

Now, one can find /i/j-horizontal strips in Vq that map over themselves 
in /Xt,-vertical strips such that Assumptions 1 and 3 of Chapter 25 hold; 
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see Figure 26.0.11. The details needed to prove these statements are very 
similar to those needed for the proof of Moser’s theorem, and it will be an 
instructive exercise for the reader to give a rigorous proof. □ 



From the outline of the proof of the Smale-Birkhoff homoclinic theorem, 
one can see how it differs from Moser’s theorem. In both cases the invariant 
Cantor set is constructed near a homoclinic point sufficiently close to the 
hyperbolic fixed point. However, in the Smale-Birkhoff theorem, all points 
leave the Cantor set and return at the same time (i.e., after n iterates of 
/); in Moser’s construction, points leave the Cantor set and may return 
at different times (recall the definition of /^). What are the dynamical 
consequences of the two different constructions? 



26.1 Heteroclinic Cycles 



In this subsection we apply the lambda lemma to show that transverse 
heteroclinic cycles (to be defined shortly) imply the existence of transverse 
homoclinic orbits. Hence, Moser’s theorem or the Smale-Birkhoff homo- 
clinic theorem apply. Our proof will be for the two-dimensional case, and 
it follows the general n-dimensional result that can be found in Palis and 
de Melo [1982]. We begin with a preliminary result. 



Proposition 26.1.1 Let f : — >■ 6e a C’’, r > 1 diffeomorphism, 

and suppose that pi, p 2 , and ps are hyperbolic fixed points. Furthermore, 
suppose lF“(pi) transversely intersects W^{p 2 ) and W^(j> 2 ) transversely 
intersects W‘^{pfi). Then lF“(pi) transversely intersects lF®(p 3 ). 



Proof: Let <73 denote a point of transversal intersection of lF“(p 2 ) and 
W‘^{pz), and consider a curve H 23 G kF“(p 2 ) containing p 2 and qz (see 
Fig. 26.1.1). First, note the following consequence of the persistence of 
transverse intersections. 



The fact that D 23 transversely intersects implies that there 

exists e > 0 such that if £) is a curve close to D 23 then D also has 
a point of transverse intersection with W‘{p 3 ). 

Let Q 2 denote a point of transversal intersection of lF“(pi) and W^{p 2 ), 
and consider a curve D 12 G lF“(pi) containing q 2 {see Fig. 26.1.1). Then it 
follows from the lambda lemma that there exists Nq such that /^°(Hi 2 ) 
contains a curve D 12 that is e-close to H 23 . By the persistence of 
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FIGURE 26.1.1. 

transversal intersections, for e sufficiently small there exists a point of 
transversal intersection of D 12 with W‘^{p 3 ). Since VU“(pi) is invariant 
f^°{Di 2 ) C FU“(pi). Hence, has a point of transverse intersec- 
tion with FU®(p 3 ). □ 

Next we define the notion of a heteroclinic cycle. 

Definition 26.1.2 (Heteroclinic Cycle.) Let f : &e o O'", 

r >1 dijjeomorphism, and suppose that p^^pi, - ■ ■ ,Pn-i,Pn = Po o,re hy- 
perbolic fixed points such that W^{pi) transversely intersects 
i = 0, • • • , n — 1. Then the fixed points, along with their stable and unstable 
manifolds, are said to form a heteroclinic cycle. See Fig. 26.1.2. 

Theorem 26.1.3 Let f : — >■ &e a C’’, r > 1 diffeomorphism, 

and suppose that po,Pi, ■ ■ ■ ,Pn-i,Pn = Po hyperbolic fixed points that 
form a heteroclinic cycle. Then transversely intersects W^{pi), 

i = 0, ■ ■ ■ ,n — 1. 

Proof: This is an immediate consequence of Proposition 26.1.1. □ 



26.2 Exercises 



1. Consider the map 



C -I- 31 ( 5 , 17 ), 
17 !->■ AI?7 + 92 ( 5 , 17 ). 



(5,17) G (7 C R^, 



(26.2.1) 



defined in (26.0.1). Under the transformation given in (26.0.3) 




26.2 Exercises 



633 




FIGURE 26.1.2. Heteroclinic cycles imply homoclinic orbits. 



(x, y) = (5 - V - 



with 

^loc(°) = graph 
^loc(O) = graph 

Show that (26.2.1) takes the form 

X 1-^ \x + fi{x^ y), 

y p-y + h(^,y), 



with 



/i(0, y) = 0, 
f2(x,0) = 0. 



What are the specific forms of /i and /2 in terms of gi and g 2 ? 

2. Consider the region V shown in Figure 26.0.1. Show that V can be chosen so that 

y^O(V’) and f~ ^(Vi) appear as in Figure 26.0.1. In particular, show that for some 
positive integers ko^ki > 0, both f^^{V) and lie in the first quadrant with 

their sides coinciding with the particular pieces of W'^(O) and VF^(O) as shown in 
Figure 26.0.1. 

3. Concerning step 2 of the set-up for the proof of Theorem 26.0.5, prove that if VF^(O) 
and W^(0) intersect transversely at a point q, then they also intersect transversely at 
f^{q) for any integer k. 

4. This exercise is concerned with the choosing of the /j.h.-horizontal strips in the proof 

of Theorem 26.0.5. Consider the sequence of integers, Nq -\- ji , Nq + 1 • • • with 

Ui = /"o+^RVb) n Vi. ( 26 . 2 . 2 ) 



Show that the sequence {ji, j 2 i ■ ■ ■ ■> jn, ■ ■ ■} can be chosen such that the Vi are disjoint. 

5. Following the outline given after Theorem 26.0.6, prove the Smale— BirkhofT homoclinic 
theorem. 
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6. Discuss the dynamical similarities and differences between Moser’s theorem (Theorem 
26.0.5) and the Smale— Birkhoff homoclinic theorem (Theorem 26.0.6). In particular, 
how do orbits differ in the invariant sets constructed in each theorem? 

7. Suppose is (r > 1), having a hyperbolic fixed point at po whose stable 

and unstable manifolds intersect transversely as shown in Figure 26.2.1a. 

We are interested in the dynamics near po- Suppose that in local coordinates {x,y) 
near po the linearization of / has the form 

!)(;)■ ii'' 

SO that orbits near po appear as in Figure 26.2.1b. However, we know that the stable 
and unstable manifolds near po oscillate infinitely often as shown in Figure 26.2.1a. 

Are Figures 26.2.1a and 26.2.1b contradictory? If not, show how orbits in Figure 26.2.1b 
are manifested in Figure 26.2.1a. 

8. Consider a (r > 1) diffeomorphism 

/: -> R^ 

having hyperbolic fixed points at po and pi, respectively. Suppose q G IF^(po) C 
VF^(pi); then q is called a heteroclinic point, and if W^(po) intersects W^(pi) trans- 
versely at q, q is called a transverse heteroclinic point. In order to be more descriptive, 
sometimes q is referred to as being heteroclinic to po and pi; see Figure 26.2.2a. 





FIGURE 26.2.1. 
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(a) Does the existence of a transverse heteroclinic point imply the existence of a 
Cantor set on which some iterate of / is topologically conjugate to a full shift 
on N (N > 2) symbols? 

(b) Additionally, suppose that W^{po) intersects W^{pi) transversely to form a 
heteroclinic cycle as shown in Figure 26.2.2b. Show that in this case one can 
find an invariant Cantor set on which some iterate of / is topologically conjugate 
to a full shift on N {N > 2) symbols. {Hint: mimic the proof of Theorem 26.0.5.) 

(c) Suppose that a branch oiW^ (Po) coincides with a branch oiW^ (Po) > yet (Po) 
intersects VF^(pi) transversely at q\ see Figure 26.2.2c. Does it follow that you 
can find an invariant Cantor set on which some iterate of / is topologically 
conjugate to a full shift on N {N > 2) symbols? 





(b) 




FIGURE 26.2.2. 
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Orbits Homoclinic to 
Hyperbolic Fixed Points in 
Three-Dimensional 
Autonomous Vector Fields 



In this chapter we will study the orbit structure near orbits homoclinic 
to hyperbolic fixed points of three-dimensional autonomous vector fields. 
The term “near” refers to both phase space and parameter space. We 
will see that in some cases Smale horseshoe-type behavior may arise. In 
parametrized systems the creation of the horseshoes may be accompanied 
by cascades of period-doubling and saddle-node bifurcations as described in 
Theorem 32.1.2, or the horseshoes may “explode” into creation at a critical 
parameter value. We will see that the nature of the orbit structure near 
the homoclinic orbits depends mainly on two properties of the vector field: 

1. the nature of the eigenvalues of the linearized vector field at the fixed 
point; 

2. the existence of multiple homoclinic orbits to the same hyperbolic 
fixed point which could be a consequence of symmetries of the vector 
field; 

3. the global geometry of the stable and unstable manifolds (roughly 
speaking, how they “twist” in phase space). 



Regarding Property 1, it should be clear that there are only two possi- 
bilities for the three eigenvalues associated with the linearized vector field. 

1. Saddle Ai, A2, A3 real with Ai, A2 < 0, A3 > 0. 

2. Saddle-focus p ± iuj, A with p < 0, A > 0. 



All other possibilities for hyperbolic fixed points follow from these two 
via time reversal. We will analyze each situation individually, but first we 
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want to describe the general technique of analysis that will apply to both 
cases. 

The study of homoclinic and heteroclinic bifurcations has exploded in the 
past 10 years and it is impossible to do it justice in one book in any detail. 
Our approach will be more elementary so as to develop some intuition about 
the key issues that are involved. We will also give an extensive literature 
survey. 



27.1 The Technique of Analysis 



Consider a three-dimensional autonomous C’’ (r > 2) vector field having 
a hyperbolic fixed point at the origin with a two-dimensional stable mani- 
fold and a one-dimensional unstable manifold such that a homoclinic orbit 
connects the origin to itself (i.e., W“(0) fl W^(0) yf 0) (see Figure 27.1.1 
for the two possibilities according to the nature of the linearized flow near 
the origin). 

The strategy will be to define a two-dimensional cross-section to the 
vector field near the homoclinic orbit, and to construct a map of the cross- 





FIGURE 27.1.1. a) Saddle- focus, b) Saddle with purely real eigenvalues. 
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section into itself from the flow generated by the vector field. This is exactly 
the same idea that was used in Chapter 10 and was used to prove Moser’s 
theorem in Chapter 26. Let us be more precise. 

Consider cross-sections Hq and Hi transverse to the homoclinic orbit 
and located in a “sufficiently small neighborhood of the origin” as shown 
in Figure 27.1.2. 

We construct a Poincare map of Hq into itself 

P: Hq — >■ Hq, 

which will be the composition of two maps, one constructed from the flow 
near the origin 

IIo — >■ Hi 

and the other constructed from the flow defined outside a neighborhood of 
the origin 

Pi: Hi — >• Ho. 



Then we have 



P = Pio Po:no 



Hn 





FIGURE 27.1.2. Poincare map near the homoclinic orbit, a) Saddle-focus, b) 
Saddle with real eigenvalues. 
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see Figure 27.1.2. Thus, the entire construction requires four steps. 



Step 1: Define Hq and Hi. We will do this in each of the cases individu- 
ally. As is typical, the choice of a cross-section on which to define a 
Poincare map requires some knowledge of the geometrical structure of 
the phase space. A clever choice can often simplify the computations 
considerably. 

Step 2: Construction of Pq- For Hq and Hi located sufficiently close to the 
origin, the map of Ho into Hi is “essentially” given by the flow gen- 
erated by the linearized vector held. We put “essentially” in quotes, 
because using the linearized vector held to construct Pq does intro- 
duce an error. However, the error can be made arbitrarily small by 
taking Hq and Hi sufficiently small and close to the origin; see Wig- 
gins [1988] and Bakaleinikov and Silbergleit [1995a], [1995b]. More- 
over, this error is truly negligible in the sense that it does not affect 
our results. Therefore, we will construct Pq from the flow generated 
by the linearized vector held in order to avoid unnecessary technical 
distractions. 

Step 3: Construction of Pi. Let po = kF“(0) IT Hq and pi = 1F“(0) THi. 
Then the time of flight from po to pi is finite, since we are outside of a 
neighborhood of the fixed point. We will assume that, except for the 
origin, the homoclinic orbit is bounded away from all other possible 
fixed points of the vector held. Then, by continuity with respect to 
initial conditions, for Hq sufficiently small, the flow generated by the 
vector held maps Hq onto Hi. This implies that the map Pi is defined 
for Ho sufficiently small. 

Thus, Pq is defined, but how is it computed? Taylor expanding Pi 
about Po gives 



Pi{h) — Po + DPi{pi)h + 0{\h\'^), 

where h represents coordinates on Hi centered at pi. Now for Hi 
sufficiently small the 0{\Wf) term in this expression can be made 
arbitrarily small. Therefore, for Pi: Hi — >■ Hq, we will take 

Pi{h) = Po + DPi{pi)h. 

Of course, this approximation to Pi introduces an error. However, in 
Wiggins [1988] and Bakaleinikov and Silbergleit [1995a,b] it is shown 
that the error is truly negligible in the sense that it does not affect 
our results. In some cases it will be necessary to assume that DPi{pi) 
is a diffeomorphism. 
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Step 4-’ Construction of P = P\ o Pq. With Pq and P\ defined the con- 
struction of P is obvious. 



Let us now make some heuristic remarks. Our analysis will give us in- 
formation on the orbit structure in a sufficiently small neighborhood of 
the homoclinic orbit. The map Pq can be constructed exactly from the 
linearized vector field (since we can “solve” linear, constant coefficient or- 
dinary differential equations). Hence, we can compute how Hq is stretched, 
contracted, and, possibly, folded as it passes near the fixed point. Now P\ 
might appear to present a problem, since we cannot even compute DP\ {pi ) 
without solving for the flow generated by the nonlinear vector field. For- 
tunately, and perhaps surprisingly, it will turn out that we do not need to 
know DPi (pi ) exactly, only that it is compatible with the geometry of the 
homoclinic orbit. This will be made clear in the examples to which we now 
turn. 



27.2 Orbits Homoclinic to a Saddle-Point with 
Purely Real Eigenvalues 



Consider the following 
X = Xix + fi{x,y,z; p.), 

y = \ 2 y + f 2 {x,y,z-,p), (x, y, z, /x) G x x x Mb (27.2.1) 

z = X 3 Z + f 3 {x,y,z; p), 



where the fi are and they vanish at (x,y,z,p) = (0,0, 0,0) and are 
nonlinear in x, y, and z. Hence, (4.8.1) has a fixed point at the origin with 
eigenvalues given by Ai, A2, and A3. We make the following assumptions. 

Assumption 1. Ai,A2 < 0, A3 > 0. 

Assumption 2. At p = 0, (27.2.1) possesses a homoclinic orbit F connecting 
(x, y, z) = (0, 0, 0) to itself. Moreover, we assume that the homoclinic orbit 
breaks as shown in Figure 27.2.1 for /i > 0 and p < 0. 

The following remarks are now in order. 

Remark 1. We assume that the parameter dependence is contained in the 
fi and not in the eigenvalues Ai, A2, and A3. This is mainly for convenience 
and does not affect the generality of our results. 
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Remark 2. In Figure 27.2.1 we drew the homoclinic orbit entering a neigh- 
borhood of the origin along a curve that is tangent to the y-axis at the 
origin. This assumes that A 2 > Ai and that the system is generic. We deal 
with these issues in the exercises. Our results will not change for generic 
systems if Ai > A 2 . 

We will analyze the orbit structure in a neighborhood of F in the standard 
way by computing a Poincare map on an appropriately chosen cross-section. 
We choose two rectangles transverse to the flow, which are defined as follows 

Ho = {{x,y,z) € I |x| < £,?/ = £,0 < 2 ; < e}, 

Hi = G I |x| < £, |y| < £,z = £}, 



for some £ > 0; see Figure 27.2.2. 

Computation of Pg 

The flow linearized at the origin is given by 

xft) = xoe^^*, 

y{t) = yoe^^\ (27.2.3) 

z{t) = zoe^^\ 



and the time of flight from Hq to Hi is given by 




jLt > 0 



/X= 0 



fX <0 



FIGURE 27.2.1. 




642 



27. Orbits Homoclinic to Hyperbolic Fixed Points 



1 , £ 
t = — log—. 

A3 Zo 



(27.2.4) 



Hence, the map 



Ho 



Hi 



is given by (leaving off the subscript O’s) 




,(|y=A. 

\ ^ 



(27.2.5) 



Computation of Pi 

Following the discussion of Step 3 in our general analysis above, we take 
as Pi the following affine map 



( 



X 

y 

€ 



) 







Pi: Hi 



Hn 



/ a b 0\ 
0 0 0 
\c d 0 J 





(27.2.6) 



where a, b, c, d, e, and / are constants. Note from Figure 27.2.1 that we 
have / > 0, so we may rescale the parameter /i so that / = 1. Henceforth, 
we will assume that this has been done. Let us briefly explain the form of 
(27.2.6). On Hq the y coordinate is fixed at y = e. This explains why there 
are only zeros in the middle row of the linear part of (27.2.6). Also, the z 
coordinate of Hi is fixed at z = £. This explains why there are only zeros 
in the third column of the matrix in (27.2.6). 




FIGURE 27.2.2. 
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The Poincare Map P = P\o Pq 

Forming the composition of Pq and P\, we obtain the Poincare map defined 
in a neighborhood of the homoclinic orbit having the following form. 



P = Pi o Pq: IIo — >■ IIo, 



i-i- 



^(f) 

(f) 



A1/A3 







(27.2.7) 



where IIo is chosen sufficiently small so that Pi o Pq is defined. 

We reiterate that the approximate Poincare map (27.2.7) is valid for e: 
sufficiently small and x and z sufficiently small. For e sufficiently small, 
the approximation of Pq by the linearized flow is valid and, for x and z 
sufficiently small, the approximation of Pi by the affine map Pi is valid. 
Note that e, x, and z are independent. 



Calculation of Fixed Points of P 

Now we look for fixed points of the Poincare map (which will correspond 
to periodic orbits of (27.2.1). First some notation; let 

Then the condition for fixed points of (27.2.7) is 

X = Axzl^il/^3 -b -b e^i, (27.2.8) 

z = Cxzl^il/^3 -b -b PL. (27.2.9) 



Solving (27.2.8) for x as a function of z gives 

_ P2;P2|/A3 

^ 1 — z1zI^iI/A3 



(27.2.10) 



We will restrict ourselves to a sufficiently small neighborhood of the 
homoclinic orbit so that z can be taken sufficiently small in order that the 
denominator of (27.2.10) can be taken to be 1. Substituting this expression 
for X into (27.2.9) gives the following condition for fixed points of (27.2.7) 
in terms of z and pL only 



z- pL = -b 



(27.2.11) 




644 



27. Orbits Homoclinic to Hyperbolic Fixed Points 



We will graphically display the solutions of (27.2.11) for /i sufficiently small 
and near zero by graphing the left-hand side of (27.2.11) and the right-hand 
side of (27.2.11) and seeking intersections of the curves. 



First, we want to examine the slope of the right-hand side of (27.2.11) 
at z = 0. This is given by the following expression 



dz 






|Ai-bA2| ^^^HH^_i 



As 



(27.2.12) 



If we assume that Pi is invertible then ad — be yf 0. This implies that 
AD — BC yf 0 so that C and D cannot both be zero. Therefore, at z = 0, 
(27.2.12) takes the values 

oo if I All < A3 or IA2I < A3 

0 if I All > A 3 and IA 2 I > A 3 . 



There are four possible cases, two each for both the infinite-slope and zero- 
slope situations. The differences in these situations depend mainly on global 
effects, i.e., the relative signs of A, B, C, D, e, and /r. We will consider this 
more carefully shortly. Figure 27.2.3 illustrates the graphical solution of 
(27.2.11) in the zero-slope case. The two-slope cases illustrated in Figure 
27.2.3 give the same result, namely, that for /i > 0 a periodic orbit bifur- 
cates from the homoclinic orbit. 

In the infinite-slope case the two possible situations are illustrated in 
Figure 27.2.4. Interestingly, in the infinite-slope case we get two different 
results; namely, in one case we get a periodic orbit for /x < 0, and in the 
other case a periodic orbit for /x > 0. So what is going on? As we will 
shortly see, there is a global effect in this case that our local analysis does 
not detect. Now we want to explain this global effect. 

Let T be a tube beginning and ending on Hq and Hi, respectively, which 
contains F. Then r fl 1F'*(0) is a two-dimensional strip which we denote as 
TZ. Suppose, without twisting TZ, that we join together the two ends of TZ. 
Then there are two possibilities: 1) 1F'*(0) experiences an even number of 
half-twists inside t, in which case, when the ends of TZ are joined together it 
is homeomorphic to a cylinder, or 2) 1F'*(0) experiences an odd number of 
half-twists inside r, in which case, when the ends of TZ are joined together 
it is homeomorphic to a Mobius strip; see Figure 27.2.5. The reader should 
verify this experimentally with a strip of paper. 
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We now want to discuss the dynamical consequences of these two situa- 
tions. First, consider the rectangle 2? C Ho shown in Figure 27.2.6, which 
has its lower horizontal boundary in VF^(O). We want to consider the shape 
of the image of V under Pq- From (27.2.5), Pq is given by 




(27.2.13) 



FIGURE 27.2.3. Graphical solution of (27.2.11) in the zero-slope case. 





FIGURE 27.2.4. Graphical solution of (27.2.11) in the infinite-slope case. 
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where, to avoid confusion, we label the coordinates in IIi by x' and y' . 
Now consider a horizontal line in 2?, i.e., a line with z = constant. From 
(27.2.13) we see that this line is mapped to a line in Hi given by 



y = e 




= constant. 



(27.2.14) 



However, the length of this line is not preserved, since 



X 

X 



© 






0 as z — >■ 0 



(27.2.15) 



because Ai < 0 < A3. Next consider a vertical line in T>, i.e., a line with 
X = constant. From (27.2.13) we see that 




z 



= £ 



. 1 + 



^2 

^3 Z 




(27.2.16) 




FIGURE 27.2.5. 
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Hence, for — A 2 > A 3 , the length of vertical lines is contracted under Pq 
as z — >■ 0 and, for — A 2 < A 3 , the length of vertical lines is expanded 
under Pq as z — >■ 0. Now, (27.2.15) implies that a horizontal line in V on 
the stable manifold of the origin is contracted to a point (i.e., Pq is not 
defined here). From these remarks we thus see that V is mapped to the 
“half-bowtie” shape as shown in Figure 27.2.6. Note that for A 2 > Ai the 
“vertical” boundary of the “half-bowtie” is tangent to the y-axis at the 
origin. If A 2 < Ai, it would be tangent to the x-axis at the origin. Since we 
are assuming A 2 > Ai for the purpose of illustrating the construction and 
geometry of the maps, we show only the first case. 

Under the map Pi the half-bowtie Po(P) is mapped back around F with 
the sharp tip of Pq(P) coming back near F fl Hq. In the case where TZ is 
homeomorphic to a cylinder, Pq(P) twists around an even number of times 
in its journey around F and comes back to Hq lying above IF'*(0). In the 
case where TZ is homeomorphic to a mobius strip, Pq(P) twists around an 
odd number of times in its journey around F and returns to Hq lying below 
IF'*(0); see Figure 27.2.7. 

At this point, we will return to the four different cases that arose in 
locating the bifurcated periodic orbits and see which particular global effect 
occurs. 

Recall from (27.2.11) that the z components of the fixed points were 




(b) 



FIGURE 27.2.6. 
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obtained by solving 



l^l+^2l Pii IM 

z = CBz ^3 + Ce^z ^3 + Dz ^3 + fj,. 



(27.2.17) 



The right-hand side of this equation thus represents the z-component of 
the first return of a point to Hq. Then, at /x = 0, the first return will be 
positive if we have a cylinder (C) and negative if we have a mobius band 
(M). Using this remark, we can go back to the four cases and label them 
as in Figure 27.2.8. 

We now address the question of stability of the bifurcated periodic orbits. 



Stability of the Periodic Orbits 
The derivative of (27.2.7) is given by 





FIGURE 27.2.7. 
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DP = 



As 



As 



i>ii|Ai| i^ii 1 IA 9 I 1^21 1 
Cz^'-^Cxz^-^ + '-^Dz^-^ 
A3 A3 



(27.2.18) 



Stability is determined by considering the nature of the eigenvalues of 
(27.2.18). The eigenvalues of DP are given by 



tr DP 1 

71,2 = ± -\/(trDP)2 - 4det(DP), 



(27.2.19) 



where 



lAol l-^l+-^2l — -^3 

det DP = '-^{AD - BC)z ^3 
A 3 

1^1 1 I Ai I ]Ail_i 
tr DP = Az ^3 + L^(Jxz ^3 
A 3 



A 3 



(27.2.20) 



Slope Zero (IX^I > X3, IX2I >^3) 




Cyllnder(C) 



Moblut Band (M) 



Siopeoo (IX^I < X3 or IX2I < X3) 




Cylinder (C) 




FIGURE 27.2.8. 
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Substituting equation (27.2.10) for x at a fixed point into the expression 
for tr DP gives 



Pni 

tr DP = Az ^3 



|Ai 



I-A.1+A2I T 

■CBz^3 ^ 



IA 2 



■Dz^ 



l^2l 



A3 



i^ii 



(27.2.21) 



Let us note the following important facts. 



For 2 ; sufficiently small 



det DP is 



J a) arbitrarily large for |Ai + A 2 I < A 3 ; 
( b) arbitrarily small for |Ai + A 2 I > A 3 . 



tr DP is 



J a) arbitrarily large for |Ai| < A 3 or IA 2 I < A 3 ; 
(b) arbitrarily small for |Ai| > A 3 and IA 2 I > A 3 . 



Using these facts along with (27.2.19) and (27.2.20) we can conclude the 
following. 



1. For |Ai| > A 3 and IA 2 I > A 3 , both eigenvalues of DP can be made 
arbitrarily small by taking z sufficiently small. 

2. For |Ai + A 2 I > A 3 and |Ai| < A 3 and/or IA 2 I < A 3 , one eigenvalue 
can be made arbitrarily small and the other eigenvalue can be made 
arbitrarily large by taking z sufficiently small. 

3. For |Ai + A 2 I < A 3 , either both eigenvalues can be made arbitrarily 
large, or one eigenvalue can be made arbitrarily large and the other 
arbitrarily small, by taking z sufficiently small. 



We summarize our results in the following theorem. 



Theorem 27.2.1 For /i yf 0 and sufficiently small, a periodic orbit bifur- 
cates from F in (27.2.1). The periodic orbit is a 



i) sink for |Ai| > A 3 and IA 2 I > A 3 ; 

ii) saddle for |Ai + A 2 I > A 3 , |Ai| < A 3 , and/or IA 2 I < A 3 ; 

iii) saddle or source for |Ai + A 2 I < A 3 . 

We remark that the construction of the Poincare map used in the proof 
of Theorem 27.2.1 was for the case A 2 > Ai (see Figure 27.2.1); however. 
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the same result holds for A 2 < Ai and Ai = A 2 . We leave the details to the 
reader in the exercises. 

Next we consider the case of two homoclinic orbits connecting the saddle 
type fixed point to itself and show how under certain conditions chaotic 
dynamics may arise. 



27 . 2 a Two Orbits Homoclinic to a Fixed Point 
Having Real Eigenvalues 



We consider the same system as before; however, we now replace Assump- 
tion 2 with Assumption 2' given below. 

Assumption 2' . Equation (27.2.1) has a pair of orbits, E^, E^, homoclinic 
to (0, 0, 0) at ^ = 0, and E^. and E^ lie in separate branches of the unstable 
manifold of (0, 0, 0). There are thus two possible generic pictures illustrated 
in Figure 27.2.9. 



Note that the coordinate axes in Figure 27.2.9 have been rotated with 
respect to those in Figure 27.2.1. This is merely for artistic convenience. We 
will consider only the configuration of Case a in Figure 27.2.9; however, the 
same analysis (and most of the resulting dynamics) will go through for Case 
b. Our goal will be to establish that the Poincare map constructed near the 
homoclinic orbits contains the chaotic dynamics of the Smale horseshoe or, 
more specifically, that it contains an invariant Cantor set on which it is 
homeomorphic to the full shift on two symbols (see Chapter 24) . 



We begin by constructing the local cross-sections to the vector field near 



(27.2.22) 



the origin. 


We define 
















IT'’ 


= {i.x,y,z) 


G 




\v = 


= e, \x\ < e, 


0 

A 


< 


e}, 


ET^ 


= {i.x,y,z) 


G 




\y = 


= e, \x\ < e, 


VI 

(0 

1 


2; 


A 

0 




= [i.x,y,z) 


G 


]R 3 


\z = 


= e, |a;| < e, 


0 

A 


< 


e}, 


n! 


= {i.x,y,z) 


G 


]R 3 


\z = 


= |x| < 


0 

A 


y 


<e} 



for £ > 0 and small; see Figure 27.2.10 for an illustration of the geometry 
near the origin. 

Now recall the global twisting of the stable manifold of the origin. We 
want to consider the effect of this in our construction of the Poincare map. 
Let Tr (resp. re) be a tube beginning and ending on IIJ (resp. 11^) and IIq 
( resp. IIq) which contains Ei. (resp. E^) (see Figure 27.2.5). Then TrnlE'*(0) 
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y 




FIGURE 27.2.9. 




FIGURE 27.2.10. 
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(resp. Ti n VP^(O)) is a two-dimensional strip, which we denote as TZr (resp. 
TZi). If we join together the two ends of TZr (resp. TZi) without twisting TZr 
(resp. TZi), then TZr (resp. TZi) is homeomorphic to either a cylinder or a 
mobius strip (see Figure 27.2.5). Thus, this global effect gives rise to three 
distinct possibilities. 



1. TZr and TZi are homeomorphic to cylinders. 

2. TZr is homeomorphic to a cylinder and TZi is homeomorphic to a 
mobius strip. 

3. TZr and TZ^ are homeomorphic to mobius strips. 

These three cases manifest themselves in the Poincare map as shown in 
Figure 27.2.11. 

We now want to motivate how we might expect a horseshoe to arise in 
these situations. Consider Case 1. Suppose we vary the parameter /r so 






P (TT^) 



P(<) 

P 



P 



FIGURE 27.2.11. 
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that the homoclinic orbits break, with the result that the images of IIq and 
IIq move in the manner shown in Figure 27.2.12. The question of whether 
or not we would expect such behavior in a one-parameter family of three- 
dimensional vector fields will be addressed shortly. 

From Figure 27.2.12 one can begin to see how we might get horseshoe- 
like dynamics in this system. We can choose /x/j-horizontal strips in IIq and 
IIq, which are mapped over themselves in /x„-vertical strips as /i is varied, 
as shown in Figure 27.2.13. The conditions on the relative magnitudes of 
the eigenvalues at the fixed point will insure the appropriate stretching 
and contracting directions. Note that no horseshoe behavior is possible at 

fj, = 0. 

Of course, many things need to be justified in Figure 27.2.13, namely. 




FIGURE 27.2.12. 




- Vertical Strips 



FIGURE 27.2.13. 
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the stretching and contraction rates and also that the little “half-bowties” 
behave correctly as the homoclinic orbits are broken. However, rather than 
go through the three cases individually, we will settle for studying a specific 
example and refer the reader to Afraimovich, Bykov, and Silnikov [1983] for 
detailed discussions of the general case. However, first we want to discuss 
the role of parameters. 

In a three-dimensional vector field one would expect that varying a pa- 
rameter would result in the destruction of a particular homoclinic orbit. 
In the case of two homoclinic orbits we cannot expect that the behavior 
of both homoclinic orbits can be controlled by a single parameter result- 
ing in the behavior shown in Figure 27.2.11. For this we would need two 
parameters where each parameter could be thought of as “controlling” a 
particular homoclinic orbit. In the language of bifurcation theory this is 
a global codimension-two bifurcation problem. However, if the vector field 
contains a symmetry, e.g., (27.2.1) is invariant under the change of coordi- 
nates (x,y,z) — >■ {—x,y,—z), which represents a 180° rotation about the y 
axis, then the existence of one homoclinic orbit necessitates the existence 
of another so that one parameter controls both. For simplicity, we will treat 
the symmetric case and refer the reader to Afraimovich, Bykov, and Sil- 
nikov [1983] for a discussion of the nonsymmetric cases. The symmetric 
case is of historical interest, since this is precisely the situation that arises 
in the much-studied Lorenz equations; see Sparrow [1982]. 

The case we will consider is characterized by the following properties. 
Assumption 1' . 0 < — A2 < A3 < — Ai, d yf 0. 

Assumption 2' . Equation (27.2.1) is invariant under the coordinate trans- 
formation (x,y,z) — >■ {—x,y,—z), and the homoclinic orbits break for /i 
near zero in the manner shown in Figure 27.2.14. 

Assumption AT insures that the Poincare map has a strongly contracting 
direction and a strongly expanding direction (recall from (27.2.6) that d 
is an entry in the matrix defining Pi, and d yf 0 is a generic condition). 
The reader should recall the discussion of Figure 27.2.6, which explains the 
geometry behind these statements. 

Now, the Poincare map P of Hq U Hg into Hg U Hg consists of two parts 

P,n:H” ^ H” UH^, (27.2.23) 



with Pr given by (27.2.7), and 



p,:n^o^H5un^g, 



(27.2.24) 
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where by the symmetry we have 



(27.2.25) 



Our goal is to show that, for /i < 0, P contains an invariant Cantor set on 
which it is topologically conjugate to the full shift on two symbols. This is 
done in the following theorem. 



Theorem 27.2.2 There exists < 0 such that, for /iq < < 0, P 

possesses an invariant Cantor set on which it is topologically conjugate to 
the full shift on two symbols. 



Proof: The method behind the proof of this theorem is the same as that 
used in the proof of Moser’s theorem (see Chapter 26). In IIqUIIq we locate 
two disjoint /r?i-horizontal strips that are mapped over themselves in two 
/x„-vertical strips so that Assumptions 1 and 3 of Section 25 hold. 

We choose /i < 0 fixed. Then we choose two p.ft,-horizontal strips, one 
in Ilg and one in IIq, where the “horizontal” coordinate is the z axis. We 
choose the horizontal sides of the strips to be parallel to the x-axis so that 
fXh = 0. Then, under the Poincare map P defined in (27.2.23) and (27.2.24), 
since A 3 < — Ai and /i is fixed, we can choose the two /i/j -horizontal strips 




/X > 0 




/X = 0 




/X < 0 



FIGURE 27.2.14. 
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sufficiently close to VP^(O) so that the image of each /x/j-horizontal strip 
intersects both horizontal boundaries of each of the /i?i-horizontal strips as 
shown in Figure 27.2.15. Thus, it follows that Assumption 1 holds. 

Next we must verify that Assumption 3 holds. This follows from a calcu- 
lation very similar to that which we performed to show that Assumption 3 
holds in the proof of Moser’s theorem in Chapter 26. It uses the fact that 
— A 2 < A 3 < — Ai and d yf 0. We leave the details as an exercise for the 
reader. □ 

We left out many of the details of the proof of Theorem 27.2.2. In the 
exercises we outline how one would complete the missing details. 

The dynamical consequences of Theorem 27.2.2 are stunning. For /i > 0, 
there is nothing spectacular associated with the dynamics near the (broken) 
homoclinic orbits. However, for ^ < 0, the horseshoes and their attendant 
chaotic dynamics appear seemingly out of nowhere. This particular type of 
global bifurcation has been called a homoclinic explosion. 



27.2b Observations and Additional References 

We have barely scratched the surface of the possible dynamics associated 
with orbits homoclinic to a fixed point having real eigenvalues in a third- 
order ordinary differential equation. There are several issues which deserve 
a more thorough investigation. 




FIGURE 27.2.15. 
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1. Two Homoclinic Orbits without Symmetry. See Afraimovich, Bykov, 
and Silnikov [1983] and the references therein. 

2. The Existence of Strange Attractors. Horseshoes are chaotic invari- 
ant sets, yet all the orbits in the horseshoes are unstable of saddle 
type. Nevertheless, it should be clear that horseshoes may exhibit a 
striking effect on the dynamics of any system. In particular, they are 
often the chaotic heart of numerically observed strange attractors. 
For work on the “strange attractor problem” associated with orbits 
homoclinic to fixed points having real eigenvalues in a third-order 
ordinary differential equation, see Afraimovich, Bykov, and Silnikov 
[1983] and Shashkov and Shil’nikov [1994]. Most of the work done on 
such systems has been in the context of the Lorenz equations. Refer- 
ences for Lorenz attractors include Sparrow [1982], Guckenheimer and 
Williams [1980], and Williams [1980]. Recently, some breakthroughs 
have been made in proving the existence of strange attractors in such 
equations by Rychlik [1990] and Robinson [1989]. Recently, an elegant 
computer assisted proof of the existence of a strange attractor for the 
Lorenz equations was given by Tucker [1999]; see also Morales et al. 
[1998] and the popular articles of Stewart [2000] and Viana [2000]. 

3. Bifurcations Creating the Horseshoe. In the homoclinic explosion an 
infinite number of periodic orbits of all possible periods are created. 
The question arises concerning precisely how these periodic orbits 
were created and how they are related to each other. This ques- 
tion also has relevance to the strange attractor problem. See Robin- 
son [2000], Gonchenko et al. [1996], [1997], and Silnikov and Turaev 
[1997]. 

In recent years Birman, Williams, and Holmes have been using the 
knot type of a periodic orbit as a bifurcation invariant in order to 
understand the appearance, disappearance, and interrelation of pe- 
riodic orbits in third-order ordinary differential equations. Roughly 
speaking, a periodic orbit in three dimensions can be thought of as 
a knotted closed loop. As system parameters are varied, the periodic 
orbit may never intersect itself due to uniqueness of solutions. Hence, 
the knot type of a periodic orbit cannot change as parameters are 
varied. The knot type is therefore a bifurcation invariant as well as a 
key tool for developing a classification scheme for periodic orbits. For 
references, see Birman and Williams [1983a, b], Holmes [1986], [1987], 
Holmes and Williams [1985], Ghrist et al. [1997], Gilmore [1998], and 
Plumecoq and Lefranc [2000a,b]. 
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We now consider the dynamics near an orbit homoclinic to a fixed point of 
saddle-focus type in a third-order ordinary differential equation. This has 
become known as the Silnikov phenomenon, since it was first studied by 
Silnikov [1965]. 

We consider an equation of the following form 
X = px - ujy + P{x,y,z), 

y = ujx + py + Q{x,y,z), (27.3.1) 

z = Az + R(x, y, z), 

where P, Q, R are and 0(2) at the origin. It should be clear that (0, 0, 0) 
is a fixed point and that the eigenvalues of (27.3.1) linearized about (0, 0, 0) 
are given by p ± ico, A (note that there are no parameters in this problem 
at the moment; we will consider bifurcations of (27.3.1) later). We make 
the following assumptions on the system (27.3.1). 

Assumption 1. Equation (27.3.1) possesses a homoclinic orbit T connecting 
(0, 0, 0) to itself. 

Assumption 2. X > —p > 0. 

Thus, (0,0,0) possesses a two-dimensional stable manifold and a one- 
dimensional unstable manifold which intersect nontransversely; see Figure 
27.3.1. 

In order to determine the nature of the orbit structure near T, we con- 
struct a Poincare map defined near T in the manner described at the be- 




FIGURE 27.3.1. 
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ginning of this section. 

Computation of Pq 

Let Ho be a rectangle lying in the x — z plane, and let Hi be a rectangle 
parallel to the x — y plane at z = e; see Figure 27.3.2. As opposed to the 
case of purely real eigenvalues, fig will require a more detailed description. 
However, in order to do this we need to better understand the dynamics of 
the flow near the origin. 

The flow generated by (27.3.1) linearized about the origin is given by 

x{t) = e^*{xo cos Lot — yo sinwt), 

y{t) = e'’* (xg sin wt + j/o cos wt) , (27.3.2) 

z{t) = Zoe^K 

The time of flight for points starting on Hg to reach Hi is found by solving 

£ = zoe^* (27.3.3) 





FIGURE 27.3.2. 
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(A logf) 

We now consider Ho more carefully. For Ho arbitrarily chosen it is possible 
for points on IIo to intersect IIo many times before reaching IIi. In this 
case, Po would not map Hq diffeomorphically onto Po(no). We want to 
avoid this situation, since the conditions for a map to possess the dynamics 
of the shift map described in Section 25 are given for diffeomorphisms. 
According to (27.3.2), it takes time t = 2Tijuj for a point starting in the 
X — z plane with cc > 0 to return to the x — z plane with cc > 0. Now let 
x = e, 0<z<£be the right-hand boundary of Hq. Then if we choose 
X = 0 < 2 < £ to be the left-hand boundary of Hq, no point 

starting in the interior of Hq returns to Hq before reaching Hi. We take 
this as the definition of Ho: 

Ho= {{x,y,z) GK^|y = 0, < x < £, 0 < z < £}. (27.3.6) 

Hi is chosen large enough to contain Po(no) in its interior. 

We now want to describe the geometry of Po(no). Hi is coordinatized by 
X and y, which we will label as x' , y' to avoid confusion with the coordinates 
of Hq. Then, from (27.3.5), we have 

(x',y')= (^(^)'^'cos(^log^),x(^)"^'sin(^log^)). (27.3.7) 

Polar coordinates on Hi give a clearer picture of the geometry. Let 

r = x''^ + y'^, — = tan 9. 

x' 



(27.3.5) 



Then (27.3.7) becomes 

{r,9) = ,^log^^. (27.3.8) 

Now consider a vertical line in Hq, i.e., a line with x = constant. By (27.3.8) 
it gets mapped into a logarithmic spiral. A horizontal line in Hq, i.e., a line 
with z = constant, gets mapped onto a radial line emanating from (0, 0, e). 
Consider the rectangles 

Rk = {(x,y, z) G |y = 0, ee^ <x<e, 

-2-K(k + l)>. -2T^k\ ^ 

ee < z < £6 }. 



(27.3.9) 




662 



27. Orbits Homoclinic to Hyperbolic Fixed Points 



Then we have 

OO 

Ho = [J 

k=0 



We study the geometry of the image of a rectangle Rk by determining the 
behavior of its horizontal and vertical boundaries under Pq. We denote 
these four line segments as 



h'“ = {{x,y, z) £ \ y = 0, z = ee 



-2nkX 2-rrp 

^ ,ee ^ 



< X < e}, 



0 f , ^ Q , — 27 r(fc + l)A 27 rp 

= {{x,y, z) £ \y = 0, z = ee ,ee^ < a; < e|, 



f O I -27T(fc + l)X -2nkX ^ 

= {(a;, y, z) € K I y = 0, X = £, ee “ < z < ee " } , 

= |(x, y, z) G K I y = 0, X = ee^ , ee < .z < ee 



(27.3.10) 



see Figure 27.3.3. The images of these line segments under Pq are given by 

Po(^“) = -2) G I z = e, 0 = 27rfc, ee < r < ee^^ }, 

Po(h^) = {(r, 0, z) G I z = e, 0 = 27r(/c + 1), ee < r < ee 

Poiv"^) = {{’’’t^jZ) G I z = e, 2i:k < 0 < 2n{k + l),r{9) = ee^ }, 

Po{v^) = {{rjOjZ) G I z = e, 2Trk < 0 < 2n{k + l),r{0) = ee }, 

(27.3.11) 



so that Po{Rk) appears as in Figure 27.3.3. 

The geometry of Figure 27.3.3 should give a strong indication that horse- 
shoes may arise in this system. 




FIGURE 27.3.3. 
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Computation of Pi 



From the discussion at the beginning of this section, we approximate Pi 
by an affine map as follows 



Pi: Hi — >■ fig, 



X 

y 

e 






/a b 0\ 
0 0 0 
\c d 0/ 





(27.3.12) 



where (x, 0,0) = F fl Hg is the intersection of the homoclinic orbit with 
Ilg (note: by our choice of Hg, F intersects Fg only once). We remark that 
the structure of the 3x3 matrix in (27.3.12) comes from the fact that the 
coordinates of Hi are x and y with z = e = constant and the coordinates 
of Hg are x and 2 with y = 0. 



The Poincare Map P = Pi o Pq 
Composing (27.3.5) and (27.3.12) gives 






P = Pi O Pg:IIg -)> Hg, 
[acos(^log^)+6sin(^log^) 

[ccos(^log^)+dsin(^log^) 



J 

(27.3.13) 



where Hg is chosen sufficiently small (by taking e small). Thus, P(Ftg) 
appears as in Figure 27.3.4. 



Our goal is to show that P contains an invariant Cantor set on which it 
is topologically conjugate to a full shift on (at least) two symbols. 

Consider the rectangle Rk shown in Figure 27.3.5. In order to verify the 
proper behavior of horizontal and vertical strips in P^, it will be necessary 
to verify that the inner and outer boundaries of P{Rk) both intersect the 
upper boundary of Rk as shown in Figure 27.3.5 or, in other words, the 
upper horizontal boundary of Rk intersects (at least) two points of the 
inner boundary of P{Rk)- Additionally, it will be useful to know how many 
rectangles above Rk P{Rk) also intersects in this manner. We have the 
following lemma. 
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Lemma 27.3.1 Consider Rk for fixed k sufficiently large. Then the inner 
boundary of P{Rk) intersects the upper horizontal boundary of Ri in (at 
least) two points for i > k/a where 1 < a < —Xjp. Moreover, the preimage 
of the vertical boundaries of P{Rk)C\Ri is contained in the vertical boundary 

ofRk- 



Proof: The 2 : coordinate of the upper horizontal boundary of Ri is given by 



z = ee 




(27.3.14) 



and the point on the inner boundary of Po{Rk) closest to (0,0, e) is given 

by 

47TP 27rfcp 

ruiin = £e " e - . (27.3.15) 

Since Pi is an affine map, the bound on the inner boundary of P{Rk) = 
Pio Pn{Rk) can be expressed as 

47rp 2irkp 

= Kse—e— (27.3.16) 



for some K > 0. The inner boundary of P{Rk) will intersect the upper 
horizontal boundary of Ri in (at least) two points provided 

^ > 1. (27.3.17) 

Using (27.3.14) and (27.3.16), we compute this ratio explicitly and find 

= Ke^ (27.3.18) 



Because Ke^'^^l'^ is a fixed constant, the size of (27.3.18) is controlled by 
the i term. In order to make (27.3.18) larger than one, it is 
sufficient that kpPiX is taken sufficiently large. By Assumption 2 we have 
A -I- p > 0, so for i > kja, 1 < a < —Xjp, kp + iX is positive, and for k 
sufficiently large, (27.3.18) is larger than one. 

We now describe the behavior of the vertical boundaries of Rk- Recall 
Figure 27.3.5a. Under Pq the vertical boundaries of Rk map to the inner and 
outer boundaries of an annulus-like object. Pi is an invertible affine map; 
hence, the inner and outer boundaries of Po{Rk) correspond to the inner 
and outer boundaries of P{Rk) = Pi o Po{Rk). Therefore, the preimage of 
the vertical boundary of P{Rk) H Ri is contained in the vertical boundary 
of Rk- □ 
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Lemma 27.3.1 points out the necessity of Assumption 2, since, if we had 
instead — p > A > 0, then the image of Rk would fall below Rk for k 
sufficiently large, as shown in Figure 27.3.5b. 

We now can state our main theorem. 

Theorem 27.3.2 For k sujficiently large, Rk contains an invariant Can- 
tor set, Ak, on which the Poincare map P is topologically conjugate to a 
full shift on two symbols. 

Proof: The proof is very similar to the proof of both Moser’s theorem 
(see Section 26) and Theorem 27.2.2. In Rk we must find two disjoint 
p/j-horizontal strips that are mapped over themselves in /x„-vertical strips 
on which Assumptions 1 and 3 of Section 25 hold. 

The fact that such /i^-horizontal strips can be found on which Assump- 
tion 1 holds follows from Lemma 27.3.1. The fact that Assumption 3 holds 
follows from a calculation similar to that given in Moser’s theorem and 
Theorem 27.2.2. In the exercises we outline how one fills in the details of 
this proof. □ 

We make several remarks. 

Remark 1. The dynamics of P are often described by the phrase “P has a 
countable infinity of horseshoes.” 

Remark 2. Note from Lemma 27.3.1 that the horseshoes in the different Rk 
can interact. This would lead to different ways of setting up the symbolic 
dynamics. We will deal with this issue in the exercises. 

Remark 3. If one were to break the homoclinic orbit with a perturbation, 
then only a finite number of the Ak would survive. We deal with this issue 
in the exercises. 



27.3a The Bifurcation Analysis of Glendinning 
AND Sparrow 

Now that we have seen how complicated the orbit structure is in the neigh- 
borhood of an orbit homoclinic to a fixed point of saddle-focus type, we 
want to get an understanding of how this situation occurs as the homo- 
clinic orbit is created. In this regard, the analysis given by Glendinning 
and Sparrow [1984] is insightful. 
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Suppose that the homoclinic orbit in (27.3.1) depends on a scalar pa- 
rameter /i in the manner shown in Figure 27.3.6. 



We construct a parameter-dependent Poincare map in the same manner 
as when we discussed the case of a fixed point with all real eigenvalues. 
This map is given by 






fe\p/^r w, e ,.w. £i _ 

a; I - J [a cos — log — h o sm — log - J + e^x + x 
\ z y A z A z~^ 



A 



^ ^(“) [ccos ^ log ^ -I- dsin ^ log -I- j 



, (27.3.19) 



where, from Figure 27.3.6, we have / > 0. We have already seen that 
this map possesses a countable infinity of horseshoes at /x = 0, and we 
know that each horseshoe contains periodic orbits of all periods. To study 
how the horseshoes are formed in this situation as the homoclinic orbit is 
formed is a difficult (and unsolved) problem. We will tackle a more modest 



ti< 0 






FIGURE 27.3.6, 
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problem which will still give us a good idea about some things that are 
happening; namely, we will study the fixed points of the above map. Recall 
that the fixed points correspond to periodic orbits which pass through a 
neighborhood of the origin once before closing up. First we put the map in 
a form which will be easier to work with. The map can be written in the 
form 






^ pcos log ^ + e/x + a; 






(z) ^ <7COS (^^log^ +(/)2) +M 



(27.3.20) 



where we have rescaled /x so that / = 1 (note that / must be positive) . 



Now let 








‘^loge + ^ 1 , 



(i = qe ^ 

CJ 

^2 = ^loge + ti2- 



Then the map takes the form 

( x\ cos(^log z + <j)i) + + x\ 

cos(^ log z + d) 2 ) + ^ /■ 



(27.3.21) 



Now we will study the fixed points of this map and their stability and 
bifurcations. 



Fixed Points 

The fixed points are found by solving 

X = axz^ cos(^log z + ^i) + efj, + x, 

Z = (3xz^ COs(^ log Z + < 1 > 2 ) + F- 



Solving (27.3.22) for x as a function of z gives 

e/x + X 

X —— . 

1 — az^ cos(^ log z + <j)i) 



(27.3.22) 

(27.3.23) 



(27.3.24) 



Substituting (27.3.24) into (27.3.23) gives 
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(z — /x)(l — « 2 :'^cos(^logz-l-<I>i)) = (e^-l-a;)/3z'^cos(^logz-l-<&2)- (27.3.25) 

Solving (27.3.25) gives us the z-component of the fixed point; substituting 
this into (27.3.22) gives us the x-component of the fixed point. In order to 
obtain an idea about the solutions of (27.3.25) we will assume that z is so 
small that 

1 — o;z'^cos(^logz-|- d)i) ~ 1; (27.3.26) 

then the equation of the z component of the fixed point will be 

(z — ^) = (e^-l-x)/3z'^cos(^logz-l- < 1 ) 2 ). (27.3.27) 

We solve (27.3.27) graphically by drawing the graph of both the right- and 
left-hand sides of (27.3.27) and looking for points of intersection. There are 
various cases shown in Figure 27.3.7. 

^ < 1 . In the the case (5 < 1, we have 



Case 1:5 <1 



z-H 




z-n 




z-ii 




11=0 



/i>0 



FIGURE 27.3.7. Graphical construction of the solutions of (27.3.27) for 5 < 1. 
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fj, < 0: finite number of fixed points; 

/i = 0: countable infinity of fixed points; 
fj, > 0: finite number of fixed points. 



^ > 1 . The next case is <5 > 1, i.e., Assumption 2 does not hold. We show 
the results in Figure 27.3.8. In the case 5 > 1, we have 

fj, < 0: there are no fixed points except the one at z = /r = 0 (i.e., the 
homoclinic orbit). 

/X > 0: For z > 0, there is one fixed point for each /x. This can be seen as 
follows: the slope of the wiggly curve is of order z^~^, which is small 
for z small, since 5 > 1. Thus, the z — fx line intersects it only once. 

Case 2 - 6>1 





FIGURE 27.3.8. Graphical construction of the solutions of (27.3.27) for 5 > 1. 
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Again, the fixed points which we have found correspond to periodic orbits 
of the parametrized version of (27.3.1) which pass once through a neighbor- 
hood of zero before closing up. Our knowledge of these fixed points allows 
us to draw the following bifurcation diagrams in Figure 27.3.9. 

The (5 > 1 diagram should be clear; however, the (5 < 1 diagram may 
be confusing. The wiggly curve in the diagram above represents periodic 
orbits. It should be clear from Figure 27.3.9 that periodic orbits are born 
in pairs, and the one with the lower z value has the higher period (since it 
passes closer to the fixed point). We will worry more about the structure 
of this curve as we proceed. 

Stability of the Fixed Points 

The Jacobian of the map is given by 




5>i 




FIGURE 27.3.9. a) J > 1. b) 5 < 1. 
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A C 
D B 



where 

A = cos(^ log z + <&i), 

B = /3xz^~^ [S cos(^ log z + ^ 2 ) - Csin(^logz + < 1 ) 2 )], 
C = axz'^“^[(5cos(^ log z + <&i) — ^ sin(^ log z + 

D = (3z^ cos(^ log z + <I> 2 )- 

The eigenvalues of the matrix are given by 

Ai,2 = ]^[{A + B)±^{A + by - A{AB -CD)Y 



(27.3.28) 



(27.3.29) 



^ > 1 . For 5 > 1, it should be clear that the eigenvalues will be small if z is 
small (since both z^ and z^~^ are small). Hence, for i5 > 1, the one periodic 
orbit existing for /r > 0 is stable for fj, small, and the homoclinic orbit at 
/X = 0 is an attractor. 

The case i5 < 1 is more complicated. 

(5 < 1 . First notice that the determinant of the matrix given by AB — CD 
only contains terms of order z^^~^ , so the map will be 

area-contracting A < (5 < 1, 
area-expanding 0 < <5 < 

for z sufficiently small. 

We would thus expect different results in these two different 6 ranges. 

Now recall that the wiggly curve whose intersection with z — fx gave the 
fixed points was given by 

(e/i -I- x)l3z^ cos(^ log z -I- < 1 ) 2 ) ■ 



Thus, from (27.3.28), a fixed point corresponding to a maximum of this 
curve corresponds to B = 0, and a fixed point corresponding to a zero 
crossing of this curve corresponds to D = 0. We now want to look at the 
stability of fixed points satisfying these conditions. 

D = 0 . In this case Ai = A, X 2 = B. Thus, for z small, Ai is small and A 2 
is always large; hence, the fixed point is a saddle. Note in particular that. 
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for fj, = 0, D is very close to zero; hence all periodic orbits will be saddles 
as expected. 

B = 0 . The eigenvalues are given by 

Xi,2 = 1[A±VA^ + 4CD], 

and both eigenvalues will have large or small modulus depending on 
whether CD is large or small, since 

A^ - can be neglected compared with CD ^ 

A^ can be neglected compared with CD ^ 

Whether or not CD is small depends on whether 0<i5<|ori<(5<l. 
Hence, we have 

stable fixed points for i < <5 < 1. 
unstable fixed points for 0 < i5 < ^. 

Now we want to put everything together for other z values (i.e., for z such 
that B,D ^ 0). 

Consider Figure 27.3.10, which is a blow-up of Figure 27.3.7 for various 
parameter values. In this figure the intersection of the two curves gives us 
the z coordinate of the fixed points. 

Now we describe what happens at each parameter value shown in Figure 
27.3.10. 



Z-/X7 




FIGURE 27.3.10. 
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fj, = At this point we have a tangency, and we know that a saddle-node 
pair will be born in a saddle-node bifurcation. 

fj, = II 5 : At this point we have two fixed points; the one with the lower z 
value has the larger period. Also, the one at the maximum of the 
curve has B = 0; therefore, it is stable for 5 > |, unstable for <5 < |. 
The other fixed point is a saddle. 

/X = ^ 4 : At this point the stable (unstable) fixed point has become a saddle 
since D = 0. Therefore, it must have changed its stability type via a 
period-doubling bifurcation. 

fj, = ^ 3 : At this point B = 0 again; therefore, the saddle has become either 
purely stable or unstable again. This must have occurred via a reverse 
period-doubling bifurcation. 

fj, = ^ 2 - A saddle-node bifurcation occurs. 



Hence, we finally arrive at Figure 27.3.11. 

Next we want to get an idea of the size of the “wiggles” in Figure 27.3.11 
because, if the wiggles are small, that implies that the one-loop periodic 
orbits are only visible for a narrow range of parameters. If the wiggles are 
large, we might expect there to be a greater likelihood of observing the 
periodic orbits. 

Let us denote the parameter values at which the tangent to the curve in 
Figure 27.3.11 is vertical by 

Mo Mi-t-l? ' * * ; f^i+m ' ' ' ^ (27.3.30) 



Period 















Saddle Orbit 

Stable (6>1/2) Orbit 

Unstable (S<1/2) Orbit 
— e— >- Period Doubling 
Bifurcation 




FIGURE 27.3.11. 
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where the /ti alternate in sign. Now recall that the z component of the fixed 
point was given by the solutions to the equations 



z — fx= (e/i -I- x)f3z^ cos(^ log z -I- < 1 ) 2 ). 



(27.3.31) 



Thus, we have 

Zi- Hi = (e^i -I- x)l3zf cos(^ log z* -I- $ 2 ) 
5 

1+1 



Zi+i - Hi+i = {em+i + cos(^ log z^+i -b ^> 2 )- 



From (27.3.32) and (27.3.33), we obtain 

Zj - xPzf cos(g log z^ -I- ^ 2 ) 

1 -b ePzf cos(^ log Zj -b ^> 2 ) ' 



Hi = 



Mi-tl — 



z»+i - xf3zf^-^ cos(ClogZi+i -b d> 2 ) 
1 -b el3zf,-^ cos(^ log Zi+i + $ 2 ) 



Now note that we have 



Cl Cl ^ ^ Zi+I 7T 

^ log Zi+i - ^log Zi « 7T ^ « exp -, 

2:* c 



(27.3.32) 

(27.3.33) 



(27.3.34) 

(27.3.35) 

(27.3.36) 



and we assume that z << 1 so that 

1 + cos(C log Zi(i+i) + $ 2 ) ~ 1. 

Finally, we obtain 

Hi+l _ Zi+i + [X/3C0S(^ log Zi -b ^ 2 )]zt+i 



Ht 



Zi - [xl3 cos(^ log Zi -b ^ 2 )]zf 



(27.3.37) 



(27.3.38) 



Now, in the limit as z — >■ 0, (27.3.38) becomes 

s 



M»+i 

Ht 



Zi+l 



- exp 1 — 



ttS 



(27.3.39) 



Recall that S = —pjX, ^ = —u/X. We thus obtain 



,. pi+l 

iim = — exp — . 

i->oo Hi OJ 



(27.3.40) 



This quantity governs the size of the oscillations we see in Figure 27.3.11. 
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27.3b Double-Pulse Homoclinic Orbits 



Now we will show that, as we break our original homoclinic orbit (the prin- 
cipal homoclinic orbit), other homoclinic orbits of a different nature arise, 
and the Silnikov picture is repeated for these new homoclinic orbits. This 
phenomenon was first noted by Hastings [1982], Evans et al. [1982], Gas- 
pard [1983], and Glendinning and Sparrow [1984]. We follow the argument 
of Gaspard. 

When we break the homoclinic orbit, the unstable manifold intersects 
Hq at the point (e/r + T, /i). Thus, if /r > 0, this point can be used as an 
initial condition for our map. Now, if the z component of the image of this 
point is zero, we will have found a new homoclinic orbit which passes once 
through a neighborhood of the origin before falling back into the origin. 
This condition is given by 

0 = j3{epL + x)pL^ cos(^ log pL + 4 ) 2 ) + pL (27.3.41) 

or 

—pL = j3{epL + x)pL^ cos(^ log p, + 4>2). (27.3.42) 

We find the solutions for this graphically for <5 > 1 and 5 < 1 in the same 
manner as we investigated the equations for the fixed points; see Figure 
27.3.12. 

Thus, for (5 > 1 , the only homoclinic orbit is the principal homoclinic 
orbit which exists at p = 0. 

For (5 < 1 , we get a countable infinity of p values 

pi-i Pi+l-! * * * ) Pi+ni * ‘ ‘ ^ 9, (27.3.43) 



for which these subsidiary or double-pulse homoclinic orbits exist, as shown 
in Figure 27.3.13. 

Note for each of these homoclinic orbits, we can reconstruct our original 
Silnikov picture of a countable infinity of horseshoes. For a reference dealing 
with double-pulse homoclinic orbits for the case of real eigenvalues, see 
Yanagida [1987]. 



27.3c Observations and General Remarks 



We end this section with some final observations and remarks. 
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Remark 1: Comparison Between the Saddle with Real Eigenvalues and the 
Saddle-Focus. Before leaving three dimensions, we want to re-emphasize 
the main differences between the two cases studied. 

Real Eigenvalues. In order to have horseshoes it was necessary to start 
with two homoclinic orbits. Even so, there were no horseshoes near the 
homoclinic orbits until the homoclinic orbits were broken such as might 
happen by varying a parameter. It was necessary to know the global twist- 
ing of orbits around the homoclinic orbits in order to determine how the 
horseshoe was formed. 




6>1 




FIGURE 27.3.12. Graphical construction of the solutions of (27.3.42). 




FIGURE 27.3.13. 
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Complex Eigenvalues. One homoclinic orbit is sufficient for a countable 
infinity of horseshoes whose existence does not require first breaking the 
homoclinic connection. Knowledge of global twisting around the homoclinic 
orbit is unnecessary, since the spiralling associated with the imaginary part 
of the eigenvalues tends to “smear” trajectories uniformly around the ho- 
moclinic orbit. 



There is an extensive amount of work on Silnikov’s phenomenon. Below 
we give some references. 

Remark 2: Strange Attractors. Silnikov-type attractors have not attracted 
the great amount of attention that has been given to Lorenz attractors. 
The topology of the spiralling associated with the imaginary parts of the 
eigenvalues makes the Silnikov problem more difficult, but see Gonchenko 
et al. [1997] and Homburg [2002]. 

Remark 3: Creation of the Horseshoes and Bifurcation Analysis. We have 
given part of the bifurcation analysis of Glendinning and Sparrow [1984]. 
Their paper also contains some interesting numerical work and conjectures. 
The reader should also consult Gaspard, Kapral, and Nicolis [1984] and 
Gonchenko et al. [1997]. 

Remark 4-’ N onhyperbolic Fixed Points. See Deng [1990], Deng [1992], Lyu- 
bimov and Byelousova [1993], Hirschberg and Knobloch [1993], Ghamp- 
neys, Harterich, and Sandstede [1996], and Ghampneys and Rodriguez-Luis 
[1999]. 

Remark 5: Applications. The Silnikov phenomenon arises in a variety of 
applications. See, for example, Arneodo, Goullet, and Tresser [1981a,b], 
[1985], Arneodo, Goullet, Spiegel, and Tresser [1985], Arneodo, Goullet, 
and Spiegel [1982], Gaspard and Nicolis [1983], Hastings [1982], Pikovskii, 
Rabinovich, and Trakhtengerts [1979], Rabinovich [1978], Rabinovich and 
Fabrikant [1979], Roux, Rossi, Bachelart, and Vidal [1981], Vyshkind and 
Rabinovich [1976], Arneodo et al. [1993], and Wang [1993]. 

Remark 6: The Ceneral Technique for Analyzing the Orbit Structure Near 
Homoclinic Orbits. The reader should note the similarities between the 
analyses of the orbit structure near orbits homoclinic to 1) hyperbolic pe- 
riodic points of two-dimensional diffeomorphisms (or, equivalently, hyper- 
bolic periodic orbits of autonomous, three-dimensional vector fields); 2) hy- 
perbolic fixed points of three-dimensional autonomous vector fields having 
purely real eigenvalues; and 3) hyperbolic fixed points of three-dimensional 
autonomous vector fields having a pair of complex conjugate eigenvalues. In 
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all three cases a return map was constructed in a neighborhood of a point 
along the homoclinic orbit that consisted of the composition of two maps. 
One map described the dynamics near the hyperbolic invariant set (i.e., pe- 
riodic orbit or fixed point) and the other map described the dynamics near 
the homoclinic orbit outside of a neighborhood of the hyperbolic invariant 
set. Due to the fact that the fixed point is hyperbolic, the first map is well 
approximated from the linearization of the dynamical system (either map 
or vector field) about the hyperbolic invariant set (either periodic orbit 
or fixed point). The second map, if we restrict ourselves to a sufficiently 
small neighborhood of the homoclinic orbit, is well approximated by an 
affine map. In all cases, in order to show that chaotic invariant sets (more 
precisely, an invarant Cantor set on which the dynamics are topologically 
conjugate to a full shift on N symbols) are present near the homoclinic 
orbit, we find N /r^-horizontal strips that are mapped over themselves in 
/i„-vertical strips so that Assumptions 1 and 3 of Chapter 25 hold. The 
nature of the map near the hyperbolic invariant set is responsible for the 
verification of Assumption 3 (i.e., expansion and contraction in the appro- 
priate directions) . The nature of the map outside of the hyperbolic invariant 
set is responsible for the verification of Assumption 1. However, note that 
in the case of transversal orbits homoclinic to hyperbolic periodic orbits of 
n-dimensional diffeomorphisms, we did not need any special assumptions 
on the eigenvalues of the periodic orbit or the nature of the intersection of 
the stable and unstable manifolds (hyperbolicity and transversality were 
sufficient). This is very different from the situation of orbits homoclinic to 
hyperbolic fixed points of autonomous vector fields, where special assump- 
tions on the relative magnitudes of the eigenvalues at the fixed point and 
the nature of the (nontransversal) homoclinic orbit (s) were necessary. 

Remark 1: Higher Dimensional Results. For generalizations to orbits ho- 
moclinic to hyperbolic fixed points of n-dimensional (n > 4) autonomous 
vector fields, the reader should consult Turaev and Silnikov [1987], Ovsyan- 
nikov and Silnikov [1987], Deng [1989a], [1989b], [1993], Chow, Deng, and 
Fiedler [1990], Ovsyannikov and Silnikov [1992], Arnold et al. [1994], and 
Homburg [1996], Glendinning [1997], Ashwin [1997], Ragazzo [1997], Laing 
and Glendinning [1997], Fowler [1990], Chernyshev [1985], and Glendinning 
and Laing [1996]. 

Remark 8: Numerical Methods. See Champneys, Kuznetsov, and Sandst- 
ede [1996] for a review of numerical methods for computing homoclinic 
orbits. See also Beyn and Kleinkauf [1997], Liu et al. [1997], and Doedel 
and Friedman [1989]. 

Remark 9: N-Pulse Homoclinic Orbits. In recent years there has been a 
great deal of work on N-pulse homoclinic orbits or “multi-bump” orbits. See 
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Champneys [1994], Zelati and Nolasco [1999], Camassa et al. [1998], Bolotin 
and MacKay [1997], Rabinowitz [1997], Soto Trevino and Kaper [1996], 
Kaper and Kovacic [1996], Haller and Wiggins [1995], and Glendinning 
[1989]. 



Remark 10: Global Center Manifold Techniques. Recently generalizations 
of the local center manifold techniques to a global setting valid in the 
neighborhood of a homoclinic orbit have been developed. See Sandstede 
[1993], Homburg [1996], Shaskov and Turaev [1999], and Chow et al. [2000]. 

Remark 10: Heteroclinic Cycles. See Wiggins [1988] and Tresser [1984] for 
some general results in Devaney [1976] discusses some heteroclinic cy- 
cles in Hamiltonian systems. Heteroclinic cycles frequently arise in applica- 
tions. For example, they appear to be the mechanism giving rise to “burst- 
ing” in a model for the interaction of eddies in the boundary layer of fluid 
flow near a wall, see Aubry, Holmes, and Lumley [1988] and Guckenheimer 
and Holmes [1988]. See also Tian and Du [2000], Koon et al. [2000], Hom- 
burg [2000], Ashwin and Field [1999], Chossat et al. [1999], Woods and 
Champneys [1999], Chow et al [1999], Sun and Kooij [1998], Qi and Jing 
[1998], Han [1998], Ashwin and Chossat [1998], Belykh and Bykov [1998], 
Zimmermann and Natiello [1998], Chernyshev [1997], Krupa [1997], Hou 
and Golubitsky [1997], Chossat et al. [1997], Maxwell [1997], Zhu [ 1996], 
Lauterbach et al. [1996], Worfolk [1996], Blazquez and Tuma [1996], Bat- 
teli [1994], Krupa and Melbourne [1995], Lauterbach and Roberts [1992], 
McCord and Mischaikow [1992], Field and Swift [1991], Feng [1991], Chos- 
sat and Armbruster [1991], Campbell and Holmes [1991], Armbruster and 
Chossatt [ 1991], and Deng [1991]. 

Remark 11: Homoclinic Orbits in Hamiltonian and Reversible Systems. 
Early work on homoclinic orbits in Hamiltonian systems can be found in 
Devaney [1976], [1978], Holmes [1980], and Wiggins [1988]. There has been 
a great deal of additional work in these areas over the past 10 years. See 
Champneys and Harterich [2000], Zhang [2000], Bolle and Buffoni [1999], 
Arioli and Szulkin [1999], Ding and Willem [1999], Ding and Girardi [1999], 
Koltsova and Lerman [1998], Ding [1998], Koltsova and Lerman [1996], 
Koltsova and Lerman [1995], Ragazzo [1997], Mielke et al. [1992], Lerman 
[1991], Buffoni [1993], Champneys and Toland [1993], Lerman [1989], and 
Turaev and Silnikov [1989] . For some results for reversible systems see 
Afendikov and Mielke [1999], looss [1997], and Chirchill and Rod [1986]. 
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27.4 Exercises 



1. Recall the discussion in Section 27.2 and specifically the vector field (27.2.1). Would 
there be any qualitative changes in the results of these sections if the eigenvalues 
depended on the parameter //? How would this situation best be handled? 

2. Show that generically for A 2 > Ai, the homoclinic orbit in (27.2.1) for ^ = 0 is tangent 
to the y axis in the x — y plane at the origin. 

Construct a nongeneric example where this does not occur and explain why your 
example is not generic. {Hint: find an appropriate symmetry.) 

3. In Equation (27.2.10) we took 



1 

1 _ l/>'3 



1 



as 



0 . 



Show that if instead we take 



1 

1 _ 






our results are not affected for z sufficiently small. 

4. From Section 27.2, consider the case A 2 < Ai. Show that in this case the homoclinic 
orbit of (27.2.1) is tangent to the ai-axis at the origin in the x — y plane. Construct a 
Poincare map near the homoclinic orbit following Section 27.2 and show that Theorem 
27.2.1 still holds. In describing the “half-bowtie” shape of Po(Ho) in Hi, compare it 
with the case Ai < A 2 . 

5. From Section 27.2, consider the case A 2 = Ai. Describe the geometry of the return of 
the homoclinic orbit to the origin. Construct a Poincare map following Section 27.2 
and show that Theorem 27.2.1 still holds. In describing the “half-bowtie” shape of 
Po(no) ni, compare it with the cases Ai > A 2 and A 2 < Ai. 

6. Argue that if (27.2.1) possesses only one homoclinic orbit, then the Poincare map 
defined near the homoclinic orbit cannot possess an invariant Cantor set on which it 
is topologically conjugate to a full shift on N {N > 2) symbols. 

7. Recall the discussion in Section 27.2a. In Assumption 1^, discuss the necessity and 
geometry behind the requirement d ^ 0. 

8. Work out all of the details in the proof of Theorem 27.2.2. {Hint: mimic the proof of 
Moser’s theorem in Section 26.) 

9. Recall the discussion in Section 27.2a. Suppose we instead chose configuration b) in 
Figure 27.2.9 with Assumption l' and Assumption 2' still holding. Is Theorem 27.2.2 
still valid in this case? If so, what modifications must be made in order to carry out 
the proof? 

10. Work out all of the details in the proof of Theorem 27.3.2. {Hint: mimic the proof of 
Moser’s theorem in Section 26.) 

11. In Theorem 27.3.2 we proved the existence of an invariant Cantor set Afc C Rk such 
that the Poincare map restricted to Afc was topologically conjugate to a full shift on 
two symbols. This was true for all k sufficiently large. Show that Theorem 27.3.2 can be 
modified (in particular, the choice of ^^.-horizontal and //t,-vertical strips) so that Hq 
contains an invariant Cantor set on which the Poincare map is topologically conjugate 
to a full shift on N symbols with N arbitrarily large. {Hint: use Lemma 27.3.1 and see 
Wiggins [1988] if you need help.) 

Is there a difference in the dynamics of this invariant set and the dynamics in Ufc>fcQ 
constructed in Theorem 27.3.2? 
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12. Recall the construction in Theorem 27.3.2. Suppose the Poincare map P is perturbed 
(as might occur in a one-parameter family). Show that, for sufficiently small pertur- 
bations, an infinite number of the are destroyed yet a finite number survive. Does 
this contradict the fact that horseshoes are structurally stable? 

13. Recall the discussion in Section 27.3. Suppose (27.3.1) is invariant under the coordinate 
transformation 

(x, y, z) ->■ (-X, -y, -z) 
with Assumptions 1 and 3 still holding. 

a) Show that (27.3.1) must possess two orbits homoclinic to the origin. Draw the 
two homoclinic orbits in the phase space. Denote the homoclinic orbits by Pq 
and Pi . 

b) Construct a Poincare map in a neighborhood of PoUPiU{(0, 0, 0)} and show that 
the map has an invariant Cantor set on which the dynamics are topologically 
conjugate to a full shift on two symbols. 

c) If we denote the two symbols by 0 and 1, show that the motion in phase space 
is such that a ‘ 0 ’ corresponds to a trajectory following close to Pq and a ‘ 1 ’ 
corresponds to a trajectory following close to Pi. Hence, give a geometrical 
description of the manifestation of chaos in phase space. (See Wiggins [1988] if 
you need help.) 

14. Consider our description of the bifurcation analysis of Glendinning and Sparrow. Jus- 
tify the curves shown in Figure 27.3.9. 

15. Consider the sequence of bifurcations discussed in Figure 27.3.10. Explain why a 
period-doubling bifurcation occurs after and a reverse period-doubling bifurcation 
occurs before /^ 3 . 

16. Suppose we reverse the direction of time in (27.3.1), i.e., we let 

t -t, 

with Assumptions 1 and 2 still holding. Describe the dynamics near the homoclinic 
orbit. 

17. Consider the vector field (27.3.1). Suppose that Assumption 1 holds but Assumption 
2 is replaced by the following. 

Assumption 2. —p > A > 0 . 

Describe the dynamics near the homoclinic orbit. (See Holmes [1980] for help.) 

18. Consider a (r as large as necessary) autonomous vector field in R^. We denote 
coordinates in R^ hy x — y — z. Suppose the vector field has two hyperbolic fixed 
points, Pi and p 2 , respectively, in the x — y plane. 

Local Assumption: The vector field linearized at pi has the form 

X — Aix, 

y = A2P, 
z — A32, 



with Ai > 0 , A 3 < A 2 < 0 . 

The vector field linearized at p 2 has the form 



X — px — toy, 
y = ujx py, 
z = As, 



with p < 0, A > 0, cj 0. 

Global Assumption: pi and p 2 are connected by a heteroclinic orbit, denoted Fi 2 that 
lies in the x — y plane. 

P 2 and Pi are connected by a heteroclinic orbit, F 2 i, that lies outside the plane; see 
Figure 27.4.1. 

Thus, Fi 2 U F 2 i U {pi} U {P 2 } form a heteroclinic cycle. The goal is to construct a 
Poincare map, P, in a neighborhood of the heteroclinic cycle and prove the following 
theorem. 
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Theorem 27.4.1 (Tresser) P possesses a countable number of horseshoes provided 



p\2 

aa7 



< 1 . 



{Hint: P will be constructed from the composition of four maps. Define cross sections 
noi, rill, ni 2 , and Ilo 2 appropriately; see Figure 27.4.1. rio,i and Hu should be 
sufficiently close to pi, and rii 2 and rio 2 should be chosen sufficiently close to p 2 - If 
the coordinates on these cross-sections are chosen appropriately, then we can derive 
maps as follows 



Pqi: IIoi 



Hi, 




P02- rio2 IIi2, 



/ / \ 
^2 ^ COS ^ log ^ 



p/\ 



sin y log : 



Pl2- 



X2 

V2 

€ 







i>2 0 \ 
d2 0 
0 0 / 




F21 




FIGURE 27.4.1. 
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Pll- Hu — > IIo2, 

\Z2/ \0/ \0 Cl diJ \zi/ 

where xi — yi — z\ denote coordinates near pi, and — V 2 — ^2 denote coordinates 
near p 2 - These maps are approximations (see the discussion at the beginning of this 
chapter; discuss their validity and specify all steps in their derivation. 

Then the Poincare map near the heteroclinic cycle is defined as 
P = Pll o Pqi o P12 o P02: Ho 2 no2- 



The rest of the proof is very much the same as Theorem 27.3.2. (See Wiggins [1988] 
for additional help.) 

19. Consider a (r as large as necessary) autonomous vector field in R^. We denote 
coordinates in R^ hy x — y — z. Suppose the vector field has two hyperbolic fixed 
points, denoted pi and p 2 , respectively, which lie in the x — y plane. 

Local Assumption: The vector field linearized at pi has the form 

X — pix — uJiy, 
y — ujix + piy, 
z — Xiz, 



with 



Ai > 0, pi < 0 and oji ^ 0. 



The vector field linearized at p 2 has the form 



X — P 2 X — u)2y, 
y — LJ 2 X + p 2 y, 
z — A 22 , 



with A 2 < 0, p 2 > 0, and u )2 ^ 0. 

Global Assumption: There exists a trajectory Pi 2 in the x — y plane connecting pi to 
P2- 

There exists a trajectory P 2 i connecting p 2 to pi. 

See figure 27.4.2 for an illustration of the geometry. Pi 2 and P 2 i are examples of 
heteroclinic orbits, i.e., an orbit that is biasymptotic to two different fixed points. 
Pi 2 U P 21 U {pi} U {P 2 } is said to form a heteroclinic cycle. 




FIGURE 27.4.2. 
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a) Define a Poincare map in the neighborhood of the heteroclinic cycle and deter- 
mine if there are conditions on the eigenvalues (i.e., p, p, A, and A) such that the 
map possesses an invariant Cantor set on which the dynamics are topologically 
conjugate to a full shift on N {N > 2) symbols. 

b) Consider the case 



|P2| = |pi|. 
IA2I = |Ai|, 
11^21 = 



Can horseshoes exist in this case? What is the relevance of this case to Case III 
of the truncated (hence symmetric) three-dimensional normal form discussed in 
Section 33.2? 



20. Recall the motivational example for deriving Poincare maps near homoclinic orbits 
described in Chapter 10. We now describe the analog of that example for heteroclinic 
orbits. 

Consider a (r as large as necessary) two-parameter family of vector fields in the 
plane having a hyperbolic fixed points pi and p 2 , respectively. 

Local Assumption: The vector field linearized at pi is given by 



^ n /O n 

^ ai < 0, Pi > 0, 

VI = 0iVi, 

and the vector field linearized at p 2 is given by 

02 > 0 , 02 < 0 , 

where cti, /3j, i — 1,2 are constants. 



X2 — 

V 2 = P2V2, 





FIGURE 27.4.3. 
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Global Assumption. pi and p2 are connected by a heteroclinic cycle. We denote the 
heteroclinic orbit going from pi to p2 in positive time by Fi2 and the heteroclinic orbit 
going from p2 to p\ in positive time by F21. 

The heteroclinic cycle depends on the parameters as follows. For p. = {pi,p2)i let Af 
be a neighborhood of zero in the pi, p2 plane; then we assume the following. 



1 ) Fi 2 exists for all p G {(/J-i , M2) | Mi = 0 } Pi A/” = Afi . 

2 ) F21 exists for all p £ {(^1 , ^2) | M2 — 0 } Pi A/^ = A/2- 

Furthermore, we assume that F12 and F21 break “transversely” as shown in Figure 
27 . 4 . 3 . 

Construct a two-parameter family of Poincare maps near the heteroclinic cycle and 
study the bifurcation and stability of periodic orbits. 
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Melnikov’s Method for 
Homoclinic Orbits in 
Two-Dimensional, 
Time-Periodic Vector Fields 



We have seen that transverse homoclinic orbits to hyperbolic periodic 
points of two-dimensional maps gives rise to chaotic dynamics in the sense 
of Theorems 26.0.5 and 26.0.6. We will now develop a perturbation method 
originally due to Melnikov [1963] for proving the existence of transverse ho- 
moclinic orbits to hyperbolic periodic orbits in a class of two-dimensional, 
time-periodic vector fields; then by considering a Poincare map, Theorems 
26.0.5 and 26.0.6 can be applied to conclude that the system possesses 
chaotic dynamics. 



28.1 The General Theory 



We consider the following class of systems: 

dH , ^ ^ 

x= -^[x,y) + £gi{x,y,t,e), 
dy 

Q ti , , , , , , ^ 

y = -^{x,y) + £g2{x,y,t,£), 



or, in vector form, 

q = JDH{q) + £g{q,t,£), 

where q = (x,y), DH = (|f , ||), g = ( 51 , 52 ), and 



(28.1.1) 

(28.1.2) 



J = 



0 1 

-1 or 



We assume that (28.1.1) is sufficiently differentiable (C’’, r > 2 will do) on 
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the region of interest. Most importantly, we also assume that g is periodic 
in t with period T = 2tt jui. 



We referred to (28.1.1) with £ = 0 as the unperturbed system 



• 1 

X = 

dy 

dH 

y = -j^{x,y), 



(28.1.3) 



or, in vector form. 



q=JDH{q), 



(28.1.4) 



and we had the following assumptions on the structure of the phase space 
of the unperturbed system (see Figure 28.1.1). 



Assumption 1. The unperturbed system possesses a hyperbolic fixed point, 
Po, connected to itself by a homoclinic orbit (?o(i) = (a^o(f)i 2/o(^))- 

Assumption 2. Let Fp^ = {q g | g = qo{t),t G M} U {po} = W^^(po) FI 
lF“(po) U {po}. The interior of Fp^ is filled with a continuous family of 
periodic orbits q°‘{t) with period T“, a G (—1,0). We assume that 
lim„_>o 9 “(t) = qo{t), and limc_s.o T“ = oo. 



y 

X 



'Po 




FIGURE 28.1.1. 

The subharmonic Melnikov theory enabled us to understand how the pe- 
riodic orbits < 7 “(t) were affected by the perturbation; now we will develop a 
technique to see how the homoclinic orbit, Fp^, is so affected. Geometrically, 
the homoclinic Melnikov method is a bit different from the subharmonic 
Melnikov method. However, there is an important relationship between the 
two as a — >■ 0 (i.e., as the periodic orbits limit on the homoclinic orbit) that 
we want to point out later on in this section. We remark that it is possi- 
ble to develop the homoclinic Melnikov method for a more general class of 
two-dimensional, time-periodic systems than (28.1.1); in particular, we do 
not have to assume that the unperturbed system is Hamiltonian. We will 
deal with these generalizations in the exercises. 
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Our development of the homoclinic Melnikov method will consist of sev- 
eral steps, which we briefly describe. 



Step 1 . Develop a parametrization of the homoclinic “manifold” of the un- 
perturbed system. 

Step 2. Develop a measure of the “splitting” of the manifolds for the per- 
turbed system using the unperturbed “homoclinic coordinates.” 

Step 3. Derive the Melnikov function and show how it is related to the 
distance between the manifolds. 



Before beginning with Step 1 we want to rewrite (28.1.1) as an au- 
tonomous three-dimensional system (cf. Chapter 7) as follows 



• dH. . 

X = -^{x,y} + egi{x,y,(t),e}, 
dy 

y = ~^{x,y) + eg 2 {x,y,(j),e), {x,y,(j)) G x x 



or, in vector form, 

q = JDH{q) + eg{g,(j); e), 

^ = u>. (28.1.6) 



The unperturbed system is obtained from (28.1.6) by setting e = 0, i.e., 

q = JDH{q), 

0 = w. (28.1.7) 



We will see that this apparently trivial trick offers several geometrical ad- 
vantages. In particular, the perturbed system is of a very different character 
than the unperturbed system, and this trick forces us to treat them on a 
more equal footing. Also, the relationship between the splitting of the man- 
ifolds “in time” and the splitting of the manifolds on a particular Poincare 
section and how this is manifested by the Melnikov function will be more 
apparent. 



Step 1: Phase Space Geometry of the Unperturbed Vector Field: A Paramet- 
rization of the Homoclinic Manifold. When viewed in the three-dimensional 
phase space x S'^, the hyperbolic flxed point po of the q component of 
the unperturbed system (28.1.7) becomes a periodic orbit 
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lit) = {po,Ht) = + (28.1.8) 

We denote the two-dimensional stable and unstable manifolds of 7 (f) by 
and W“( 7 (t)), respectively. Because of Assumption 1 above, 
and W“( 7 (t)) coincide along a two-dimensional homoclinic man- 
ifold. We denote this homoclinic manifold by see Figure 28.1.2. We 
remark that the structure of this figure should not be surprising; it reflects 
the fact that the unperturbed phase space is independent of time {4>). 







identify 

([ 1=0 



FIGURE 28.1.2. The homoclinic manifold, F.y. The lines on T.y represent a typical 
trajectory. 

Our goal is to determine how F.y “breaks up” under the influence of the 
perturbation. We now want to describe what we mean by this statement, 
which will serve to motivate the following discussion. 

The homoclinic manifold F.y is formed by the coincidence of two two- 
dimensional surfaces, a branch of kU®( 7 (t)) and a branch of lF“( 7 (t)). In 
three dimensions, one would not expect two two-dimensional surfaces to 
coincide in this manner but, rather, one would expect them to intersect 
in one-dimensional curves as shown in Figure 28.1.3. (Note: as mentioned 
in Chapter 12, if two invariant manifolds of a vector field intersect, they 
must intersect along (at least) a one-dimensional trajectory of the vector 
held if we have uniqueness of solutions; we will explore this in more detail 
later.) Figure 28.1.3 illustrates what we mean by the term “break up” of 
F.y. Now we want to analytically quantify Figure 28.1.3. In order to do this 
we will develop a measurement of the deviation of the perturbed stable 
and unstable manifolds of 7 (f) from F.^. This will consist of measuring 
the distance between the perturbed stable and unstable manifolds along 
the direction normal to F.^. Evidently, this measurement will vary from 
point-to-point on F.y so we first need to describe a parametrization of F-,, . 




28.1 The General Theory 691 




FIGURE 28.1.3. 

Parametrization ofT^: Homoclinic Coordinates. Every point on E..^ can be 
represented by 

('Zo(— ^o)j ^o) G r.y (28.1.9) 

for to G 00 G (0,27t]. The interpretation of to is the time of flight from 
the point qo{—to) to the point qo{0) along the unperturbed homoclinic 
trajectory qo(t). Since the time of flight from qo{—to) to (?o(0) is unique, 
the map 

(^O:0o)' ^ (<Zo(~^o); 0o) (28.1.10) 

is one-to-one so that for a given (to,0o) G {qo{—to),(j)o) corresponds 

to a unique point on T.y (see Exercise 1). Hence, we have 

i "7 = {(<?, 0) G X I g = qo(-to),to G K0 0 = 0o G (0,27 t]}. (28.1.11) 

The geometrical meaning of the parameters to and 0o should be clear from 
Figure 28.1.2. 

At each point p = (<?o(~^o)) 0o) G T.^ we construct a vector, tt^, normal 
to r.^ that is defined as follows 

TTp = {^^-^{xo{-to),yo{-to)),-^{xo{-to),yo{-to)),0] (28.1.12) 

or, in vector form, 

TTp = {DH{qo{-to)),0). (28.1.13) 

Thus, varying to and 0o serves to move Tip to every point on see Figure 
28.1.4. We make the important remark that at each point p G Fj, W^(j(t)) 
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and iy"( 7 (t)) intersect transversely at p. Finally, when considering the 
behavior of F.^ near p under perturbation, we will be interested only in the 
points on Wp that are 0{e) close to p. This will be further clarified in Step 

2 . 





FIGURE 28.1.4. Homoclinic coordinates. 



Step 2: Phase Space Geometry of the Perturbed Vector Field: “The Split- 
ting of the Manifolds. ” We now turn our attention to describing how F..,, 
is affected by the perturbation. However, first we need some preliminary 
results concerning the persistence of 7 (f) along with its stable and unstable 
manifolds. 



Proposition 28.1.1 Fore sufficiently small, the periodic orbit ^(t) of the 
unperturbed vector field (28.1.7) persists as a periodic orbit, 7 e(t) = 7 (f) + 
0{s), of the perturbed vector field (28.1.6) having the same stability type 
as 7 (f) with js{t) depending on e in a C’’ manner. Moreover, 
and bFj“c( 7 e(^)) s-dose to Wf^^i^ft)) and WK^{j(t)), respectively. 



Proof: Using the idea of a Poincare map and appealing to the stable and un- 
stable manifold theorem for maps, proof of this theorem is an easy exercise 
that we leave for the reader (see Exercise 2). □ 



The global stable and unstable manifolds of 7 e(t) can be obtained from 
the local stable and unstable manifolds of 7 e(t) by time evolution as follows. 
Let denote the flow generated by (28.1.6) (note: do not confuse the 
notation for the flow, with the angle (j>). Then we define the global 

stable and unstable manifolds of Je{t) as follows 
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W"*(7.W)= U'^‘Woc(7.W), 

t<0 

W^{le{t)) = U UW^oAleit)). (28.1.14) 

i>0 



If we restrict ourselves to compact sets in x 5”^ containing iy®( 7 e(t)) and 
W^“(7e(^))) then W^®( 7 e(t)) and M^“( 7 £(t)) are C’’ functions of £ on these 
compact sets. This follows from the fact that is a C’’ diffeomorphism 
that is also C’’ in e (see Theorem 7.3.1). Our analysis of the splitting of 
the manifolds will be restricted to an 0{e) neighborhood of T.^. 

Let us describe Proposition 28.1.1 more geometrically. The content of the 
proposition is that, for some £q small, we can find a neighborhood Af{so) 
in X 5”^ containing 7 (f) with the distance from 7 (f) to the boundary 
of Af{so) being O{eo)- Moreover, for 0 < £ < £q, 7 e(t) is also contained 
in A/’(£q) with lP®( 7 (t)) n A/’(£q) = hFj®j,( 7 (t)) and lP“( 7 (t)) nA/’(£o) = 
W^ioc( 7 (i)) C’’ £-close to lT^( 7 e(t))nA/'(£o) = and lP"( 7 e(t))n 

Af{eo) = respectively. We can choose Af{eo) to be a solid torus 

as follows 

■^(£ 0 ) = {(?,</>) G I \q-Po\ < C£o,^ G (0,27 t]}, (28.1.15) 



where C is some positive constant; see Figure 28.1.5. 




(j) = 2n 



identify 



0 = 0 



FIGURE 28.1.5. 

In some of our geometrical arguments we will be comparing individual 
trajectories of the unperturbed vector field with trajectories of the per- 
turbed vector field. For this it will often be easier to consider the projection 
of these trajectories into the g-plane or a plane parallel to the g-plane. Let 
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us show how this is done. Consider the following cross-section of the phase 
space (don’t think about Poincare maps yet) 

= (28.1.16) 



It should be clear that is parallel to the g-plane and coincides with the 
g-plane for 0o = 0; see Figure 28.1.6. Note that 

7(f) n = Po (28.1.17) 

and 

r^ns-^“ = {qeR^\q = qo{t),teR} =Fp„. (28.1.18) 




FIGURE 28.1.6. 



In particular, (28.1.17) and (28.1.18) are independent of <j)o', this simply 
reflects the fact that the unperturbed vector field is autonomous. Now let 
{q{t) , (j){t)) and {qe{t),4>{t)) be trajectories of the unperturbed and per- 
turbed vector fields, respectively. Then the projections of these trajectories 
onto are given by 

(28.1.19) 

and 

(9e(i),0o). (28.1.20) 

We remark that qe{t) actually depends on <j)o (as opposed to q{t)), since the 
g-component of the perturbed vector field (28.1.6) depends on cj), i.e., the 
perturbed vector field is nonautonomous. Therefore, (28.1.20) could be a 
very complicated curve in , possibly intersecting itself many times. This 
tends to obscure much of the dynamical content of the trajectory. We will 
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remedy this situation later on when we consider a Poincare map constructed 
from the flow generated by the perturbed vector held; see Figures 28.1.7 and 
28.1.8 for an illustration of the geometry behind the projections (28.1.19) 
and (28.1.20). 




(c) 



FIGURE 28.1.7. 

We are now at the point where we can define the splitting of VU^( 7 e(t)) 
and FF“( 7 e(t)). Choose any point p G F.^. Then lF®( 7 (t)) and FF“( 7 (t)) 
intersect Tip transversely at p. Hence, by the persistence of transversal in- 
tersections and the fact that FF^( 7 e(t)) and are C’’ in e, for e 

sufficiently small FF®( 7 e(t)) and VF“( 7 e(t)) intersect TTp transversely in the 
points pI and respectively. It is therefore natural to define the distance 
between W‘‘{'^g{t)) and IF“(7e(t)) at the point p, denoted d{p,s), to be 

d{p,e) = \p^ - pI\] (28.1.21) 



see Figure 28.1.9. We will find it convenient in the next step to redefine 
(28.1.21) in an equivalent, but slightly less natural manner as follows 



, {p^-pt)-{DH{qo{-to)),0) 

^ \\DH{qo{-tom 



(28.1.22) 
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where denotes the vector scalar product and 







(?o(-<o)T+(^ 



{qo{-to))) 



2 



Because and are chosen to lie on the vector {DH{qo{—to)),0), it 
should be clear that the magnitude of (28.1.22) is equal to the magnitude 
of (28.1.21). However, (28.1.22) is a signed measure of the distance and 
reflects the relative orientations of Vb^( 7 e(t)) and near p; see 

Figure 28.1.10. Note that since and lie on TTp, we can write 







9o(do)+ 

0(E) 



^r^,(t)o,e)<o 

oto 



|^(to,^o-e)>o 



FIGURE 28.1.10. 

P^ = {q:,M (28.1.23) 



and 



Pt = (9|>0o), 



(28.1.24) 



i.e., and pi have the same </>o coordinate. Thus, (28.1.22) is the same as 

DH{qo{-to)) ■ (C - ql) 



(j)o,s) — 



\\DH{qo{-to))\\ 



(28.1.25) 
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where we are now denoting d{p,s) by d{to,4>o,£^), since every point p 
can be uniquely represented by the parameters (to, 4>o), to G R, </>o G (0, 27 t], 
according to the parametrization p = (qo{—to), (j>o) described in Step 1. 

Before deriving a computable approximation to (28.1.25) in Step 3, we 
want to address a technical issue involving the choice of and p^. Certainly 
by transversality and C’’ dependence on e, for e sufficiently small, bb®( 7 e(t)) 
and bb“(7e(t)) intersect Wp. However, these manifolds may intersect in 
more than one point (indeed, an infinite number of points is possible), as 
shown in Figure 28.1.11. The question then arises as to which points 
and p® are chosen so as to define (28.1.25). We first give a definition. 




FIGURE 28.1.11. 



Definition 28.1.2 Let G fl TTp and p“ j G W^{'fs{t)) fl 7Tp, 

i G I, where I is some index set. Let ^{t) , 4>{t)) G lF®(7e(f)) and 
{q'^ ^{t) , 4>{t)) G W“(^e(t)) denote orbits of the perturbed vector field 
(2L1.6) satisfying ((?U(0), ^(0)) = p^^ and (( 7 “i( 0 ), ^(0)) = pf ^, respec- 
tively. Then we have the following (see Figure 28.1.12). 



1. For some i = i G I we say thatp‘)j is the point in lF®( 7 e(t)) flTTp that 
is closest to 7 e(t) in terms of positive time of flight along lU®( 7 e(t)) 
if for all t > 0, (g® .(t), ((jq) n 7Tp = 0. 

2. For some i = i G I we say thatp'^j is the point in lF“( 7 £(t)) flTTp that 
is closest to js(t) w terms of negative time of flight along W'^f-^flt)) 
if, for all t < 0, (g“j(t), t^o) n 7Tp = 0. 



We make the following remarks regarding this definition. 
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FIGURE 28.1.12. 

Remark 1. For p fixed, we are interested only in the points in W‘^{'^e{t))C\TTp 
and n TTp that are 0(e) close to p. This is because our methods 

are perturbative. 

Remark 2. For the unperturbed system, the orbit through p leaves rip in 
positive and negative time and enters a neighborhood of the hyperbolic 
orbit without ever returning to tt^. The orbits through tt^ described in 
Definition 28.1.2 are the perturbed orbits that most closely behave in this 
manner. 

Remark 3. The points p® ^ and are unique. This will follow from the 
proof of Lemma 28.1.3. 



The points p| and p“ used in defining (28.1.25) are chosen to be closest 
to 7 e(t) in the sense of positive time of flight along and nega- 

tive time of flight along FF“( 7 e(t)), respectively, as described in Definition 
28.1.2. Still, the question remains as to why this choice. The consequences 
of the following lemma will answer this question. 



Lemma 28.1.3 Let p®^ (resp. p“^) be a point on lF®( 7 £(t)) fl (resp. 
lF“(7£(t))n7Tp) that is not closest to 7 e(t) in the sense of Definition 28.1.2, 
and letM{eo) denote the neighborhood of j{t) and 7 e(t) described follow- 
ing Proposition 28.1.1. Let ^(t) , (j){t)) (resp. ^(f) , (p{t))) be a trajec- 
tory in lF®( 7 e(t)) (resp. VF"( 7 e(t))) satisfying (g® .(0), ^(0)) = P®j (resp. 
(g“^(0), (/>(0)) = p“j). Then, for e sufficiently small, before {q^ j{t) , <po) , 
t > 0, (resp. {q'^ j(t) , (po) , t < 0) can intersect tt^ (as it must by Definition 
28.1.2), it must pass through M {eq) (see Figure 28.1.13). 
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FIGURE 28.1.13. 

Proof: We give the argument for trajectories in VU^( 7 £(t)); the argument 
for trajectories in VU“( 7 e(t)) will follow immediately by considering the 
time-reversed vector field. 

First we consider the unperturbed vector field. Consider any point 
{qoyM on W®( 7 (t)) n A/'(eo); see Figure 28.1.14. Let (gg(t),^(t)) G 
satisfy (< 70 ( 0)5 </'(0)) = (<Zo 5 ^o)- Then there exists a, finite time, 
—00 < < 0, such that (< 7 o(T’'*), </>(T^)) G W''^{^(t)) fl A/’(eo). In other 

words, T® is the time that it takes a trajectory leaving Af{eo) in W^{j{t)) 
to reenter A/’(eo); see Figure 28.1.14. 




FIGURE 28.1.14. 

We now want to compare trajectories in IF® ( 7 (f)) with trajectories in 
lF®( 7 e(t)). Choose points 

(<7g,<(.o)GlFi®,(7(t))nAr(£o) 
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and 

((?|,0o)GM/i*oc(7.W)nAr(£o), 

and consider trajectories 



and 



satisfying 



and 



(g^(0),</>(0)) = ((?o,</'o) 



see Figure 28.1.14. Then 



- {q^o{t),(j){t)) \ = O{eo) 



(28.1.26) 



for 0 < t < oo and, by Gronwall’s inequality (see Hale [1980]), 

|(g|(t),((.(t)) - (<?o*W,<('WI = 0{s) (28.1.27) 



for < t < 0. Therefore, a trajectory in TT®( 7 e(t)) leaving Af{eo) in 
negative time must follow 0(e) close to a trajectory in W^{j{t)) until it 
reenters Af{so) (since we only have a finite time estimate outside of A/’(eo))- 

Hence, this argument shows that (gg(t), </>(t)) and {q^{t),(j){t)) remain 
£-close for < t < oo, i.e., until (q^{t) , (j){t)) enters Af{so) under the neg- 
ative time flow. However, this argument does not rule out the fact that 
(g|(t), </>(t)) can develop “kinks” and therefore re-intersect Wp (while re- 
maining £-close to (qQ{t) , (j){t))) , as shown in Figure 28.1.15. 

This does not happen since tangent vectors of (q^{t), (p(t)) and 
(gg(t), </>(t)) are 0{e) close for < t < oo. This can be seen as follows; we 
have just shown that on < t < oo we have 

{qt{t),^{t)) = (g^(t)-f 0(£),<(.(t)), (28.1.28) 

and a vector tangent to {q^{t), (j){t)) is given by 

ql = JDH(ql) + eg{ql,<l,it)), 

^ = LO. (28.1.29) 



Substituting (28.1.28) into the right-hand side of (28.1.29) and Taylor ex- 
panding about £ = 0 gives 
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FIGURE 28.1.15. 
ql = JDH{qf;) + 0{e), 

^ = (28.1.30) 

Now, a vector tangent to {qQ{t) , (j>{t)) is given by 

ro = JDH{ql), 

(t) = uj. (28.1.31) 

Clearly, (28.1.30) and (28.1.31) are 0{e) close on T® < t < oo so that the 
situation shown in Figure 28.1.15 cannot occur on this time interval for e 
sufficiently small. 

Now let j G n 7Tp be a point that is not closest to 7 e(t) in 

terms of positive time of flight as defined in Definition 28.1.2. Let (9%(t), 
(j){t)) G lF®( 7 £(t)) satisfy (g® ^(0), </>(0)) = Then, by Definition 28.1.2, 
for some t > 0, (g® ^(t), </>o) G TT®( 7 e(t)) flTTp. Hence, by the argument given 
above, for e sufficiently small, somewhere in 0 < t < t, (g® |(t), ^(t)) must 
have entered Af{eo)- □ 

We make the following remarks regarding Lemma 28.1.3. 

Remark 1. The reader should note that T® = T®(eo). That is why it was 
necessary to consider a fixed neighborhood containing 7 (f) and 7 e(t)- 

Remark 2. From the proof of Lemma 28.1.3 it follows that the points closest 
to 7 e(t) in the sense described in Definition 28.1.2 are unique. We leave the 
details to Exercise 4. 
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Remark 3. Let pi = {ql,(j)o) G VF®(7e(t)) fl tt^, and let {ql{t),(j){t)) £ 
W^('^e(i)) satisfy (g|(0), (/)(0)) = (ql,4>o). Then if p® is the point closest to 
7e(t) in the sense of Definition 28.1.2, it follows from the proof of Lemma 
28.1.3 that 



\ql{t)-qo{t-to)\ = 0{s), tG[0,oo), (28.1.32) 

\ql(t) - qo{t -to)\ = 0{e), tG[0,oo). (28.1.33) 

A similar statement can be made for points in bT“(7e(t)) fl TTp and solu- 
tions in lT“(7e(t)). In deriving the Melnikov function in the next step we 
will need to approximate perturbed solutions in and IT“(7e(t)) 

by unperturbed solutions in ID® (7(f)) and IT“(7(t)) for semi-infinite time 
intervals with 0(s) accuracy. This is why the Melnikov function will only 
detect points on IT®(7e(t)) fl tt^ fl kT“(7e(t)) that are closest to 7e(t) in the 
sense of Definition 28.1.2. 



Step 3: Derivation of the Melnikov Function. Taylor expanding (28.1.25) 
about e = 0 gives 

dd 

d{to,4'o,s) = d{to,4>o,0) + s — {to,4>o,0) +0(e'^), (28.1.34) 



where 

d(to,0o,O) = 0 



and] 



M 

ds 



{to, (fo, 0) 



DH{qo{-to))-{^U^o-^U=o) 

\\DH{qo{-to))\\ 



(28.1.35) 



(28.1.36) 



The Melnikov function is defined to be 



M{to, 4>o) = DH{qo{—to)) ■ 



dqf 

de 



le=0 




(28.1.37) 



Now, since 

DH{qo{-to)) = [^(^o{-to)),^{qo{-to)?j 



is not zero on qo{—to), for to finite, we see that 
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M{tQ, (Pq) — 0 =k 



dd 

de 



{to, 4>o) — 0 - 



(28.1.38) 



Therefore, up to a nonzero normalization factor (||£)i?((3'o(— to))ID) the Mel- 
nikov function is the lowest order nonzero term in the Taylor expansion for 
the distance between hT^(7e(t)) and hT“(7e(t)) at the point p. 

We now want to derive an expression for M(to, ^o) that can be computed 
without needing to know the solution of the perturbed vector field. We 
do this by utilizing Melnikov’s original trick. We define a time-dependent 
Melnikov function using the flow generated by both the unperturbed vector 
field and the perturbed vector field. We have to be careful here since we 
do not have any a priori knowledge of arbitrary orbits generated by the 
perturbed vector field. However, the persistence and differentiability of 7(f), 
and W“(7(t)) as described in Proposition 28.1.1 are all that we 
need, since Definition 28.1.2 and Lemma 28.1.3 allow us to characterize the 
orbits of interest for determining the splitting of kT®(7e(t)) and 
We then derive an ordinary differential equation which the time-dependent 
Melnikov function must satisfy. The ordinary differential equation turns 
out to be first order and linear; hence it is trivially solvable. The solution 
evaluated at the appropriate time will yield the Melnikov function. 



We begin by defining the time-dependent Melnikov function as follows 



M(t; = DH{qo{t-to)) ■ 



dQeit) 



de 



s—0 



e=oJ ' 



(28.1.39) 



We want to take some care in describing precisely what we mean by 
(28.1.39). We denote orbits in W'*(7e(t)) and TT“(7e(t)) by q^it) and <7“(t), 
respectively. Then in (28.1.39) the expressions 

dq^jt) 

de 

and 

dqljt) 

de 

are simply the derivatives with respect to £ (evaluated at £ = 0) of g“(t) 
and g|(t), respectively, where <7“(t) and q^{t) satisfy 

<Z“(0) = q^, 

qm = ql- 



(28.1.41) 



(28.1.40) 



(28.1.42) 

(28.1.43) 
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The expression qo{t — to) denotes the unperturbed homoclinic orbit. Thus, 
we see that (28.1.39) is a bit unusual; part of it, {DH{qo{t — to))), evolves in 
time under the dynamics of the unperturbed vector field with the remaining 
part 




£=0 



dqljt) \ 



evolving in time under the dynamics of the perturbed vector field. It should 
be obvious that the relationship between the time-dependent Melnikov 
function and the Melnikov function is given by 

M{Q-toAo) = M{to,(i>o)- (28.1.44) 



Next we turn to deriving an ordinary differential equation that 
M{t;to,4>o) must satisfy. The expressions we derive will be a bit cumber- 
some, so for the sake of a more compact notation we define 



de 



£=0 



= W. 



dqm 



de 



£=0 



9iW- 



Then (28.1.39) can be rewritten as 

M{t- to, M = DH{qo{t - to)) ■ {q\{t) - q{{t)). (28.1.45) 



We want to introduce a further definition to compactify the notation as 
follows 

M{t- to, M = A“(t) - A%t), (28.1.46) 

where 

A“’^(t) = DH{qo{t - to)) ■ qnt). (28.1.47) 

Differentiating (28.1.47) with respect to t gives 

|(A“-*(t)) = (^f^{DH{qo{t - to)))) • qnt) 

+ DH{qo{t - to)) ■ (28.1.48) 



The term ^{qi’^{t)) in (28.1.48) needs some explanation. Recall from above 
that we have defined 
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U.S / ,\ 

Qi (t) 



de 



e=0 



and solves 

= JDH{q^^%t)) + eg{q:’^{t),^{t),e), (28.1.49) 



where (j){t) = ujt + (j)o. Since q^'^{t) is C’’ in £ and t (see Theorem 7.3.1), 
we can differentiate (28.1.49) with respect to e and interchange the order 
of the e and t differentiations to obtain 



dt V de 



JD^H{qo{t-to)) 

£=0(t) 

+ g{qo{t-to),Ht),0) (28.1.50) 



or 

^qi’"{t) = JD'^H{qo{t - to))qi’"{t) + g{qo{t - to),(j>{t),0). (28.1.51) 



Equation (28.1.51) is referred to as the first variational equation. We remark 
that qiit) solves (28.1.51) for t G (— oo,0], and ql{i) solves (4.5.51) for 
t £ (0, oo]; see the remarks following Lemma 28.1.3. Substituting (28.1.51) 
into (28.1.48) gives 

|(A“'*(t)) = ^j^{DH{q,(t - to)))) • qr{t) 

+ DH{qo{t - to)) ■ JD^H{qo{t - to))qf’"{t) 

+ DH{qo{t - to)) ■ g{qo{t - to), fit), 0). (28.1.52) 



Now a wonderful thing happens. 

Lemma 28.1.4 

(^P^(<7o(i - fo)))) • qT"it) 

+ DHiqoit - to)) ■ JD^Hiqoit - to))qf'\t) = 0. 



Proof: First note that 
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^{DH{qo{t - to))) = D‘^H{qo(t - to))qo{t ~ to) 

= [D^H{qo{t - to))) {JDH{qo{t - to))) ■ 

(28.1.53) 



Let Then we have 



{D'^H){JDH)-q'('‘' = 




dxdy 

, r d^H dH 



U.S 

Vi 



dx^ dy dxdy dx 
d'^H dH d^H dH] 



_ dxdy dy dy'^ dx 



(28.1.54) 



and 



DH ■ (JD^H)q'^’^ = |j 



dH 



m 

dy 



u,s 



U.S 

Vi 




d'^H dH 



dxdy dx 
d'^H dH 



dxdy dy 



(28.1.55) 



where we have left out the argument qo{t — to) = {xo{t — to),yo{t — to)) 
for the sake of a less cumbersome notation. Adding (28.1.54) and (28.1.55) 
gives the result. □ 



Therefore, using Lemma 28.1.4, (28.1.48) becomes 

^(A“’'*(t)) = DH{qo{t - to)) ■ g{qo{t - to), </>(t), 0). (28.1.56) 

at 

Integrating A“(t) and A®(t) individually from — r to 0 and 0 to r (r > 0), 
respectively, gives 

/.O 

A“(0)-A“(-t) = / DH{qo{t-to))-g{qo{t-to),u;t + <l>o,0)dt (28.1.57) 



and 
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A^(r)-A^(0)= [ DH{qo{t-to))-g{qo{t-to),ujt + (l>o,0)dt (28.1.58) 
Jo 



where we have substituted = ut + (j)Q into the integrand. Adding 
(28.1.57) and (28.1.58) and referring to (28.1.44) and (28.1.46) gives 



M{to, M = M(0, to, M = A“(0) - A^(0) 

= J DH{qo{t-to)) ■ g{qo{t-to),ujt + <j)o,0)dt 

+A®(r) - A“(-t). (28.1.59) 



We now want to consider the limit of (28.1.59) as r — >■ oo. 

Lemma 28.1.5 



lim A'*(r) = lim A“(-r) = 0. 

T— >-oo r—^oo 



Proof: Recall from (28.1.47) that 

A^’%t) = DH{qo{t-to))-qnt)- 

Now, as t — >■ oo (resp. — oo), DH{qo{t — to)) goes to zero exponentially fast, 
since qoft — to) approaches a hyperbolic fixed point. Also, as t — >■ oo (resp. 
— oo), q{{t) (resp. < 7 “(t)) is bounded (see Exercise 6). Hence, A®(t) (resp. 
A“(— r)) goes to zero as r — >■ oo. □ 



Lemma 28.1.6 The improper integral 

/ OO 

DH{qo{t-to)) ■ g{qo{t - to), wt + </)o, 0)dt 

-OO 



converges absolutely. 



Proof: This result follows from the fact that g{qo{t — to),ojt + 0o,O) is 
bounded for all t and DP[{qo{t — to)) goes to zero exponentially fast as 
t — >■ ±oo. □ 



Hence, combining Lemma 28.1.5 and Lemma 28.1.6, (28.1.59) becomes 

/ OO 

DH{qo{t - to)) ■ g{qo{t - to),uit + 4>o, Q)dt. (28.1.60) 

-OO 
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Before giving the main theorem we want to point out an interesting 
property of the Melnikov function. If we make the transformation 

t — >■ t + toj 



then (28.1.60) becomes 

/ OO 

DH{qo{t)) ■ g{qo{t),ujt + ujto + 4>o,0)dt. (28.1.61) 

-OO 

Recall that g{q, ■, 0) is periodic, which implies that M{to, ^o) is periodic in 
to with period 27 t/w and periodic in <f>Q with period 27 t. The geometry of this 
will be explained shortly. However, it should be clear from (28.1.61) that 
varying to smd varying 0o have the same effect. Moreover, from (28.1.61) 
and the periodicity of g{q, •, 0), it follows that 

DM , , , IdM , , , 

(28.1.62) 

O(po UJ oto 



hence, = 0 if and only if = 0. In Theorem 28.1.7 we will need to 
have yf 0 or 0. However, from (28.1.62), if one is nonzero, then 

so is the other; hence, we will state the theorem in terms of yf 0. 



Theorem 28.1.7 Suppose we have a point (to,^o) = (toi'Po) such that 



i) M{to,(j>Q)=0 and 

ii) 



dM 

dto 



_ - 7^ 0- 

(*o,<to) 



Then, for e sufficiently small, lR'*( 7 e(t)) and lR“( 7 e(t)) intersect trans- 
versely at {qo{—tQ)-\-0{e),4>Q). Moreover, if M(to,4>o) yf 0 for all {to,4>o) € 
Ri X S'!, then lT®(7e(t)) n W"(je(t)) = 0- 



Proof: Recall from (28.1.34), (28.1.36), and (28.1.37) that we have 

^(to’'^°’^) =^iiM(t°(5!))ii 

Note that if we define 

d{to, 4>o, e) = ed{to, (j>o, e), (28.1.64) 



where 
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J/j- j, _ M{to,4>o) ^ 



then 



d{to, 00 , e) = 0 ^ d{to, 0o, s) = 0. 
Therefore, we will work with d(to,0o,£)- 

Now at (to, 00, s) = (fo, 00, 0) have 

d(io,0o,O) 0- 



with 



dto 



(to, 00.0) 



1 DM 

\\DH{qo{—io))\\ dto 



^0. 



(to, 0 o) 



(28.1.65) 

(28.1.66) 



(28.1.67) 



(28.1.68) 



By the implicit function theorem, there thus exists a function 



to — to{4>oa) (28.1.69) 

for 10 — 00 1, e sufficiently small, such that 

J(to(0o, e), 00, s) = 0. (28.1.70) 

This shows that M^'*(7e(t)) and M^“(7e(t)) intersect 0{s) close to 
{qo{—to),4>o)- Next we need to worry about transversality. 



Suppose that M^'*(7£(t)) and W^{'^s{t)) intersect at some point p. Then 
recall from Chapter 12 that the intersection is said to be transversal if 

TplT*(7,(t)) + TplT“(7e(t)) = (28.1.71) 



Now, for £ sufficiently small, the points on W^{'fs{t)) and TT“(7e(t)) that 
are closest to 7e(t) in the sense of Definition 28.1.2 can be parametrized by 
to and 00. Hence, 



/ dq'^ \ 
V dto ’ d(j)o ) 



(28.1.72) 



( dql dql \ 

V dto ’ d(j)o j 



(28.1.73) 



are a basis for TpW'^{'^g{t)) and TpW‘^{'^s{t)), respectively. 



(Note: it is important for the reader to understand how (28.1.72) and 
(28.1.73) are computed. By definition, p = (g|,0o) = (g“,0o), and ql and 
q^ are the points satisfying g|(0) = g“(0) = g", where ql{t) and g“(t) 
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are trajectories in and lF“( 7 £(t)), respectively. Since those tra- 

jectories depend parametrically on to and (28.1.72) and (28.1.73) are 
simply the derivatives of the respective trajectories with respect to to and 
(j)Q evaluated at t = 0.) 



TpW^i^^eit)) and TpFF“( 7 e(t)) will not be tangent at p provided 



or 



dqe 9ql 
dto dto ^ 

9qe / Q 

500 500 ^ 



(28.1.74) 



(28.1.75) 



Differentiating d(to,0o,£) with respect to to and 0o and evaluating at the 
intersection point given by (to -I- (5(e), 0o) (where M(to, 0o) = 0) gives 



dd — 
^(io,0o,£) 



DH{qo{-to)) ■ ((5gr“)/(5to) - (5(?|)/(5to)) 



||DiJ(go(-to))|| 

_ (5M/5to)(to,0o) 2^ 

\\DH{qo{-to))\\ ^ 



(28.1.76) 



dd 

500 



{to, 00 ) 



= £ 



DH{qo{-to)) ■ ((5g")/(50o) - (5gg)/(50o)) 

\\DH{qo{-hm 
(5M/50o)(to,0o) 



\\DH{qo{-io))\\ 



+ 0{e^). 



(28.1.77) 



Hence, it should be clear from (28.1.76) and (28.1.77) that, for e sufficiently 
small, a sufficient condition for transversality is 

5M - 1 5M 

7j-(^o,<^o) = -7^(io,0o)t^O. (28.1.78) 

500 w 5to 

Finally, we leave the fact that M(to,0o) yf 0 implies FF®( 7 e(t))n 
W^“(7e(t)) = 0 as an exercise for the reader (see Exercise 7). □ 



28.2 Poincare Maps and the Geometry of the 
Melnikov Function 



We now want to describe the geometry associated with the independent 
variables, to and 0o, of the Melnikov function. 

Consider the following cross-section to the phase space x 
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= {(g,^) GR2 ^ S'i|0 = ^o}. (28.2.1) 

Since (j) = uj > 0, it follows that the vector field is transverse to . Then 
the Poincare map of into itself defined by the fiow generated by the 
perturbed vector field (28.1.6) is given by 



Ps : 

<?e(0) 



S'^°, 

9e(27r/w), 



(28.2.2) 



where {qe{t) , (j){t) = ujt+ 4>o) denotes the fiow generated by the perturbed 
vector field (28.1.6). Now the periodic orbit 7 e(t) intersects S'^° in a point 
which we denote as 

P..0o=7e(f)ns^°. (28.2.3) 

It should be clear that Ps, 4 >o is a hyperbolic fixed point for the Poincare map 
having a one-dimensional stable manifold, and a one-dimensional 

unstable manifold, IP“(p£, 0 o) given by 

ip^(P£,^j = ip^(7,(t))ns^« 



and 

VP“(P£,^J = VP“( 7 e(i)) n (28.2.4) 

respectively; see Figure 28.2.1. 

Now let us return to the Melnikov function. From our parametrization 
of the homoclinic manifold, F..^,, it follows that fixing 4 >q and varying to 
corresponds to restricting our distance measurement to a fixed cross-section 
In this case, M{to,<j>o), (f>Q fixed, is a measurement of the distance 
between VF'*(pe, 0 u) and IF“(pe, 0 (,). In this case also zeros of the Melnikov 
function correspond to homoclinic points of a two-dimensional map, and 
Moser’s theorem or the Smale-Birkhoff homoclinic theorem may be applied 
to conclude that the dynamics are chaotic (provided the homoclinic points 
are transverse). 

Alternately, we can fix tg and vary in th® Melnikov function. This 
would correspond to fixing 7 Tp at a specific point {qo{—to),4>o) on F.y and 
varying the cross-section The Melnikov function would be a measure 
of the distance between and kF“( 7 e(t)) at a fixed location in 

q but on different cross-sections Since the vector field has no fixed 
points on UbF“( 7 e(t)), as the cross-section is varied all orbits in 

IF'*('y^(t)) UkF“( 7 e(t)) must pass through with < 7 o(— ^o) fixed. Hence, no 
homoclinic orbits would be “missed” by the Melnikov function. 

However, recall the form of the Melnikov function given in (28.1.61) 

/ OO 

DH{qo{t)) ■ g{qo{t),u!t + ujto + (t)o,0)dt. (28.2.5) 



— OO 
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(p=2n 



=0oj 

identify 



(p=0 




FIGURE 28.2.1. 

It is clear from (28.2.5) that, analytically, the variation of to with (f>Q fixed 
is equivalent to the variation of </>o with to fixed. The underlying reason for 
this is that if VU^( 7 e(t)) and VU“( 7 £(t)) intersect, by uniqueness of solutions 
they cannot intersect at isolated points but, rather, must intersect along a 
trajectory (solution of (28.1.6)) that is asymptotic to 7 e(t) in both positive 
and negative time. 



28.3 Some Properties of the Melnikov Function 



Here we collect some basic properties and characteristics of the Melnikov 
function. 



1. As mentioned earlier, M(to,3o) is a signed measure of the distance. 
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We now want to explore this in more detail. Let us restrict ourselves 
to the Poincare map defined on the cross-section and view (/>o 
as fixed; then d{to,(j)o,s) measures the distance between 
and VP“(Pe, 0 o) (see Section 28.2). We recall that the distance between 

at the point p = iqo{-to), M is given by 



d{to,(po,£) 



DH{qo{-to)) ■ (C - ql) 



\\DH{qo{-to))\\ 

M{to,(l)o) 

^\\DH{qo{-tom 



0{e^) 



(28.3.1) 



where and are defined in Definition 28.1.2. Hence, 
ciently small 



-^(^Oj'/'o)^ q =k d(to, '(' ojS) ^ Q 



for e suffi- 
(28.3.2) 



Thus, using (28.3.1) and (28.3.2), Figure 28.1.10 holds if we replace 
d{to,(j)o,£) with M{to,(po)- 



2. M{to, (j>o) is periodic in to with period 2tt/uj and periodic in 0o with 
period 27 t. This follows from the fact that the perturbation, eg{q, t, e), 
is periodic in t with period 2tt/uj as well as from the form of M{to, (f>o) 
given in (28.1.61). 



3. Recall from (28.3.1) that the distance between bF®( 7 e(t)) and 
W“( 7 e(t)) is given by 



d{to, 4>o,s) 



M{to, 4>o) 

^\\DH{qo{-tom 



0{e^). 



(28.3.3) 



We want to focus on the denominator, \\DH{qo{—to))\\, in the 0{e) 
term of (28.3.3). Let us consider the situation with (j)o fixed so that 
d{to,(j)o,£) measures the distance between the stable and unstable 
manifolds of a hyperbolic fixed point of the Poincare map (see Sec- 
tion 28.2). Then, as to — >■ ±oo, the measurement of distance is being 
made close to the hyperbolic fixed point (since qo{—to) approaches the 
unperturbed hyperbolic fixed point as to ^ ±oo). Also, as to — >■ ±oo, 
\\DH{qo{—to))\\ — >■ 0, indicating that d{to, 4>0i s) oo. Geometrically, 
this means that the distance between the manifolds is oscillating un- 
boundedly near the hyperbolic fixed point. The reader should com- 
pare this analytic result with the geometrical picture given by the 
lambda lemma (Lemma 26.0.4). 



4. Suppose that the perturbation is autonomous, i.e., eg(q) does not 
depend explicitly on time. Then the Melnikov function is given by 



DH(qo(t)) ■ g(qo(t),0)dt. 



M = 



— OO 



(28.3.4) 
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In this case M is just a number, i.e., it is not a function of to and 
4>o- This makes sense, since, for autonomous two-dimensional vector 
fields, either the stable and unstable manifolds of a hyperbolic fixed 
point coincide or they do not intersect at all. In Exercise II we will 
deal more fully with the geometry of the Melnikov function for au- 
tonomous problems. 

5. Suppose the vector field is Hamiltonian, i.e., we have a (r > 2) 
function periodic in t with period T = 27 t/w given by 

He{x,y,t) = H{x,y) + eHi{x,y,t,e) (28.3.5) 

such that the perturbed vector field (28.1.1) is given by 

. OH , ^ dHi, 

x= -^[x,y) + £^—{x,y,t,e), 
dy dy 

dH , , dHi, 

y = - — {x,y) - £-^{x,y,t,s). (28.3.6) 



In this case, using (28.1.61) and (28.3.6), it is easy to see that the 
Melnikov function is given by 

/ OO 

{H, Hi}{qo{t),Lot + bjto + </>o, 0)dt, (28.3.7) 

-OO 

OH dHi dH dHi 



where 






dx dy dy dx 
is the Poisson bracket of H with Hi. 



(28.3.8) 



28.4 Homoclinic Bifurcations 

Suppose the vector field (28.1.6) depends on a scalar parameter y., i.e., 
q = JDH{q) + eg{q, (j>, y, e), 

4> = oj, (g, (/), /r) € X X (28.4.1) 

If, in a specific problem, there is more than one parameter, then consider 
all but one as fixed. In this case the Melnikov functions depend on the 
parameter y. In particular, we write 

M{to,<l>o,y)- (28.4.2) 

We will consider do fixed, i.e., the Poincare map associated with (28.4.1) 
is defined on the cross-section We have the following bifurcation the- 
orem for the Melnikov function. 




716 



28. Melnikov’s Method 



Theorem 28.4.1 Suppose we have {to,p) such that 



i) M{to,(j)o,fJ-) = 0, 

... dM 

ii) (io,0o,M) = 0; 
oto 

.... dM —N / „ 

111 ) -^{to,(po,N ^0, 

d'^M - 



Then the stable and unstable manifolds of the hyperbolic fixed point on the 
cross-section S‘^“ are quadratically tangent at {qo{—io)) + 0{e) for p, = 
p + 0{e). 



Proof: Tangency of the manifolds implies 

d{to,(j)o,p,£) = 0 , 

^ „ 

— (to,0o,/i,£) = 0. 
ato 

Let dfto, 4>o, p, e) = edlfo, (f>o, p, e) with 

A, , rm \ 

- \\DH{q,{-t,))\\ 

as in the proof of Theorem 28.1.7. Then a solution of 



(28.4.3) 



(28.4.4) 



d{to,(fo,p,£) = 0, 

dd 

^{to,(l)o,p,£) = 0, (28.4.5) 

dto 

is a solution of (28.4.3). Now by Hypotheses i) and ii) of the theorem, 
(28.4.5) has a solution at {to,4>o,Tt,0)- Hypotheses iii) and iv) allow us to 
apply the implicit function theorem to show that the solution persits for 
£ sufficiently small; the details follow exactly as in the proof of Theorem 
28.1.7 and are left as an exercise for the reader. □ 



We remark that the condition (to , 0o ) m) 7^ 0 implies that the tan- 



gency is quadratic. 
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qo(t) qjw 




FIGURE 28.4.1. 

28.5 Application to the Damped, Forced Duffing 
Oscillator 



The damped, forced Duffing oscillator is given by 



x = y, 

y = X — + e{"f cos (p — Sy ) , 

p = io. (28.5.1) 

For e = 0, (28.5.1) has a pair of homoclinic orbits (see Exercise 1.2.29) 
given by 

q^{t) = (xq (t), j/^(t)) = (±-\/2secht, =p-\/2secht tanhf); (28.5.2) 
see Figure 28.4.1. 



The homoclinic Melnikov function is given by 

/ OO 

\-Ky^{i)Y ± iy^{t) cos{ujt + ujto + (po)]dt. 

-OO 

Substituting (28.5.2) into (28.5.3) gives 

M^(to, (po) = i v^yTTWsech^ sin(o;to + Po)- 
Fixing pQ defines a cross-section 

= {{x,y,p) € K X M X S^\p = po}, 



(28.5.3) 



(28.5.4) 



(28.5.5) 



where the Melnikov function describes the splitting of the stable and un- 
stable manifolds of the hyperbolic fixed point defined on the cross-section. 
Let denote the Poincare map of the cross-section S‘^° defined by the 
flow generated by (28.5.1) and consider the case (5 = 0. Then, using the 
Melnikov function (28.5.4) and Remark 1 of Section 28.3, it is easy to ver- 
ify that the stable and unstable manifolds of the hyperbolic fixed point 
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intersect as in Figure 28.5.1 for (po = 0, tt/2, tt, and 37t/2. Figure 28.5.1 
illustrates an important point; namely, that altering the cross-section on 
which the Poincare map is defined can change the symmetry properties of 
the Poincare map. This can often result in substantial savings in computer 
time in the numerical computation of Poincare maps. We will explore these 
issues in Exercise 10 as well as consider how Figure 28.5.1 changes for <5 yf 0. 



From (28.5.4) it is easy to see that the condition for the manifolds to 
intersect in terms of the parameters (5, w, 7 ) is given by 



<5< 



/ Sttw sech^ \ 

V 2V2 ) 



7- 



(28.5.6) 




X 



FIGURE 28.5.1. 
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In Figure 28.5.2 we graph the critical surface S = ^ sec^(-n-t^/2) 
we note the following. 



8 




FIGURE 28.5.2. a) Graph of the critical surface S = ^ sec^(Tr^/2) J 

Cross-section of the critical surface for 7 = constant, c) Cross-section of the 
critical surface for ui = constant, d) Cross-section of the critical surface for 5 = 
constant. 



1. The condition (28.5.6) for intersection of the manifolds is independent 
of the particular cross-section (as it should be). 

2. If the “right-hand” branches of the stable and unstable manifolds 
intersect, then the “left-hand” branches also intersect, and vice-versa. 
However, as seen in Figure 28.5.1, the geometry of intersections of the 
right-hand branches and left-hand branches may differ. 



Thus, (28.5.6) is a criterion for chaos in the damped, forced Duffing 
oscillator as a function of the parameters ((5, o;,7). 

Finally we remark that Melnikov-type methods have been developed for 
multi-degree-of-freedom systems and for systems having more general time 
dependencies. These techniques also deal with orbits homoclinic to invari- 
ant sets other than periodic orbits. A complete exposition of this theory 
can be found in Wiggins [1988]. 
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28.6 Exercises 



1. Consider the parametrization of the homoclinic manifold given in (28.1.9). Show that 
the map 

{to, <f>o) I-*- (go(— *o), <i>o), (to, 0o) G X 
is C^, one-to-one, and onto. 

2. Recall Proposition 28.1.1. Prove that, for e sufficiently small, 7e(0 persists as a peri- 
odic orbit of period T = 27zj<jj having the same stability type as 7 (t). For information 
concerning the persistence of the local stable and unstable manifolds, see Fenichel 
[1971] or Wiggins [1994], 

3. Suppose that F-y intersects 7Tp transversely at some p — (go(~^o); ^^o)- Show that, 

for e sufficiently small, lF^(7e(0) each intersect Hp transversely at a 

distance 0{s) from p. 

4. Recall Definition 28.1.2. Show that the points p^ j and j “closest” to 7e(i) in the 
sense of Definition 28.1.2 are unique. {Hint: study the proof of Lemma 28.1.3.) 

5. Recall the set-up for the proof of Lemma 28.1.3. Choose 

(9o.'>o) e n V(eo) 

and 

e WiQj,(7e(t)) n V(eo), 

with the trajectories 

(9o(t).0(t)) e w‘{-r{t)) 

and 

(<?:(*), 0(t)) G W“(7e(t)) 

satisfying 

(9o(o)>«^(o)) = iqo’M 

and 

Prove that 
for 0 < f < oo. 

6. Suppose q^{t) £ W^( 7 g,(t)) is a 



Hi)) - (9o(0.<^(t)) = O{eo) 



solution of (28.1.6). Then show that 
59|(t)| 



ds 



= Qiii) 



is bounded in f as t ^ oo. {Hint: as t ^ oo, Q'i(t) should behave as 
Does the same result hold for solutions in W^( 7 e(t)) as t — oo? 



7. Recall Theorem 28.1.7. Show that if M(fo»0o) ^ 0 for all (toi^o) G x , then 
W^( 7 e(t)) n W’^( 7 e(f)) — 0. {Hint: study the proof of Lemma 28.1.3.) 



8. Consider the Poincare maps associated with the damped, periodically forced Duffing 
equation on the cross-sections and shown in Figure 28.5.1 for 

(5 = 0. Describe in detail how the geometry of the stable and unstable manifolds 
changes on each cross-section for 5^0. {Hint: use the Melnikov function.) 
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9. Melnikovas Method for Autonomous Perturbations 
Suppose we consider the (r > 2) vector field 

^ egi{x,y; y.,6), 

oy 

dH 3 

y = + eg 2 {x,y; ti,e), {x,y,fi)£R , 

OX 



or 



q = JDH(q) + eg{q-,fi, e), 



(28.6.1) 



where 



g = (x, y), 



DH = 



J = 



0 1 

-1 0 



9 = ( 51 , 52 ), 



with e small and fi regarded as a parameter. 

Suppose that the unperturbed system (i.e., (28.6.1) with e = 0) satisfies Assumptions 
1 and 2 of Section 28.1. Discuss the geometrical meaning of 

M(p) = J {DH ■ g)(qo{t), fj.)dt. 

Compare M{fi) with M(fo,‘?^’o) derived in Section 28. 




FIGURE 28.6.1. 



10 . The equation of motion describing the librational motion of an arbitrarily shaped 
satellite in a planar, elliptical orbit is 

(1 + eg. cos 9)'ip" — 2eg sin + 1) + 3Ki sin -if cos -if — 0 
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where Kt — ^ i® small, and efi is the eccentricity of the orbit; 

see Modi and Brereton [1969]. The geometry is illustrated in Figure 28.6.1. For s 
small, this equation can be written in the form (using the fact that i_|_g^^cos 9 ~ 
1 — sf-t cos 9 O (e^ ) ) 

'll)" + ZKi sin cos 'ip — s\2^ sin 9{'ip' + 1) 

+ 3fiKi sin 'tp cos ip cos 9] O (e^ ) . 



Use Melnikov’s method to study orbits homoclinic to hyperbolic periodic orbits for 
£ 7 ^ 0. Describe the physical manifestation of any chaotic dynamics that arise in this 
problem. 

11. The driven Morse oscillator is an equation frequently used in theoretical chemistry 
to describe the photodissociation of molecules (see, e.g., Goggin and Milonni [1988]). 
The equation is given by 



X ^ y, 

y — — + e'y cos cot, 



(28.6.2) 



with //, 7 , a; > 0 . 

For £ = 0, the equation is Hamiltonian with Hamiltonian function 
H(x,y) = ^ + fj. (^-e“* + ■ 

a) Show that for £ = 0, (x, y) — (oo, 0) is a nonhyperbolic fixed point of (28.6.2) 
that is connected to itself by a homoclinic orbit. 

We would like to apply Melnikov’s theory to (28.6.2) in order to see if (28.6.2) has 
horseshoes; however, the fixed point having the homoclinic orbit is nonhyperbolic. 
Therefore, the theory developed in this chapter does not immediately apply. Schecter 
[1987a], [1987b] has extended Melnikov’s method so that it applies to nonhyperbolic 
fixed points. However, we will not develop his technique. 

Instead, we introduce the following transformation of variables 

X — — 21ogii, y — V, (28.6.3) 



and reparametrize time as follows 



ds u 

~dt ~ ~2' 

b) Rewrite (28.6.2) in these new variables and show that the resulting equation, 
for £ = 0 , has a hyperbolic fixed point at the origin that is connected to itself 
by a homoclinic orbit. Apply Melnikov’s method in order to study homoclinic 
orbits for £ 7 ^ 0 . 

c) Describe the resulting chaotic dynamics in both the x — y and u — v coordinate 
systems. 

This problem was originally solved by Bruhn [1989]. The transformation (28.6.3) is 
known as a “McGehee transformation” in honor of Richard McGehee, who first cooked 
it up in order to study a degenerate fixed point at infinity in a celestial mechanics prob- 
lem (see McGehee [1974]). Such singularities frequently arise in mechanics and coordi- 
nate transformations such as (28.6.3) can greatly facilitate the analysis. An excellent 
introduction to such problems can be found in Devaney [1982]. 

12. Consider the C’' (r > 2) vector field 



* = + sgi(x, y, t; e), 

V = f 2 (x,y) + eg 2 (x, y, t; e), 



(x, y) e R^, 
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or 

4 = fig) + sg(q,t;e), (28.6.4) 

where 

g = {x, v), 

f = (/ 1 ./ 2 ). 

g = (91,92), 

with s small and g{q,t;e) periodic in t with period T = 2t^ ju). 

Assumption: For e = 0, (28.6.4) has a hyperbolic fixed point at po that is connected 
to itself by a homoclinic orbit, i-®-) lirnt->-±oo 9 o (0 — Po- 

a) Derive a measure of the distance between the stable and unstable manifolds of 
the hyperbolic periodic orbit that persists in (28.6.4) for e sufficiently small. 
{Hint: follow the steps in this chapter as closely as possible. If you need help, 
see Melnikov [1963].) 

b) Using the parametrization of the unperturbed homoclinic orbit in terms of 

(toi^o) ^ defined in (28.1.9), is the Melnikov function obtained 

in part a) periodic in both to ^nd ^o? Explain fully the reasons behind your 
answer. 

13. How is the Melnikov theory modified if in the unperturbed vector field we instead had 

two hyperbolic fixed points, pi and p 2 , connected by a heteroclinic orbit, qo{t), i.e., 
limt_j.oo 9o(0 = Pi linit_»._oo <?o(^) = P 2 ? {Hint: follow the development of the 

homoclinic Melnikov theory in this chapter. You should arrive at the same formula 
for the distance between the stable and unstable manifolds; however, the geometrical 
interpretation will be different.) 

14. Consider the vector field 

9 — ev, 

V — —s sin 0 cos u>t, {9 , v) ^ X , s small. 



Apply Melnikov’s method to show that the Poincare map associated with this equation 
has transverse homoclinic orbits. What problems arise? Can any conclusions be drawn 
for homoclinic orbits arising in the following vector field? 

9 ^ ei;, (28.6.5) 

V — — £sin 0 + e‘^{ — Sv + 7 coso;t). (28.6.6) 



What implications do these examples have for applying Melnikov’s method to vector 
fields 

X — e/(x, f), / — T-periodic in t, rc G R , 

that are transformed into 

V ^7{y) + sgivH), yeR^, J{y) = ^ f f{y,t)dt, 

J Jo 

by the method of averaging? 

We refer the reader to Holmes, Marsden, and Scheurle [1988] and Delshams and Seara 
[1992] for more examples of problems of this type along with some rigorous results. 

15. Consider the following vector field : 

X ^ y, 

y = X — x^ — s6y + £7 cos cot. 

Compute the Melnikov function and describe the surface in 7 — <5 — cj space where the 
bifurcation to homoclinic orbits occurs. 
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16. Fluid Transport and the Dynamical Systems Point of View. Consider the Navier— 
Stokes equations for three-dimensional viscous, incompressible fluid 

dv ^ ^ 1 2 

— (r; - V)v = -Vp V — 

at R 

where p denotes the pressure and R the Reynolds number. Additionally, boundary 
and initial conditions may be specifled. The solution of this highly nonlinear partial 
differential equation gives a velocity field, v{x,t). Suppose we are interested in the 
transport of infinitesimal fluid elements (referred to as fluid particles) in this flow. The 
fluid particles move under the influence of two processes: convection (or advection) due 
to the velocity field and molecular diffusion (since the fluid is not really a continuum). 
The motion of fluid particles due to convection is determined by 

X — v{x, t), X G 



This is simply a finite-dimensional dynamical system where the phase space is actually 
the physical space occupied by the fluid. 

If we consider two-dimensional incompressible inviscid fluid flow, the velocity field can 
be determined from a stream function, 'tp(xi, X 2 ',t), where 



v(xi, X2, t) 



/ d'ip d'ip \ 

V dx2 ’ dxi ) 



see Chorin and Marsden [1979] for background on these statements. The equations for 
fluid particle motions in this case become 



d'lp 

±1 = (xi,X2,t), 

0X2 

dip 

±2 = (xi,X 2 ,t). 

OXi 



The reader should note that this is simply a Hamiltonian dynamical system where the 
stream function plays the role of the Hamiltonian. The study of the transport and 
mixing of fluids along these lines using the framework of dynamical systems theory is 
a topic of much current interest; the reader should consult Ottino [1989] for a good 
introduction and Rom-Kedar, Leonard, and Wiggins [1990] for a specific example. 

We will consider the situation of fluid particle transport in modulated traveling waves 
in a binary fluid mixture heated from below; see Weiss and Knobloch [1989] and Moses 
and Steinberg [1988]. The stream function (in a moving frame) near an instability 
leading to time-dependent oscillations is given by 







FIGURE 28.6.2. 
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1p{xi,X2,t) = 1 po{xi,X 2 ) + eipl{x-i_, X2,t), 



where 



'iPq{xi, X 2 ) — —X 2 -\- R COS xi sin X 2 , 



i/’i (xi , X2,t) — — I I 1 — — J cos(a:i -\- cot -\- 9 ) 



+ 1 + 



cos(£Ci — u)t — 9 ) \ sin X2 



In the above cj > 0, 0 is a phase, and K, 7 , and e are parameters (amplitude depending 
on the temperature) with 0 < e << 1; see Weiss and Knobloch [1989] for a detailed 
discussion. The equations for fluid particle motions are given by 



Xx = {Xx.X2) ^ {Xx,X2.t), 

0X2 0X2 

O'tpo d'lpi 

X2 = - t ; {XlyX2) - e- {xi,X2,t). 

uXi Ox I 



For £ = 0, i? > 1, the streamlines of the flow (corresponding to the level sets of 
■ 00 (^ 153 : 2 )) appear as in Figure 28.6.2. Note the two hyperbolic fixed points on the 
a^i-axis, denoted and p_, respectively. The fixed points are connected by a pair 
of heteroclinic orbits denoted Fq and F^. This heteroclinic cycle forms a region of 
trapped fluid that is shaded in Figure 28.6.2. 

a) For £ 7 ^ 0 show that Fq persists and results in a barrier which fluid cannot cross. 

b) Show that for £ 7 ^ 0, F^ breaks up giving rise to transverse heteroclinic orbits. 
This provides a mechanism for fluid to mix between the two regions in the time- 
dependent fluid flow. 

c) Show that a horseshoe exists in the heteroclinic tangle for £ 7 ^ 0. {Hint: see 
Rom-Kedar et al. [1990].) Hence, chaotic fluid particle trajectories exist. 
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Liapunov Exponents 



A positive Liapunov exponent is often taken as a signature of chaos. We 
will address this specific question in Chapter 30. But in this chapter we 
discuss Liapunov exponents following Oseledec [1968] and Benettin et al. 
[1980a, bj. 



29.1 Liapunov Exponents of a Trajectory 

Consider the C’’ (r > 1) vector field 

X = f{x), X G M". (29.1.1) 



Let x{t,xo) be a trajectory of (29.1.1) satisfying x(0,a:o) = xq- We want to 
describe the orbit structure of (29.1.1) near x(t,xo)- In particular, we want 
to know the geometry associated with the attraction and/or repulsion of 
orbits of (29.1.1) relative to x{t,xo)- For this it is natural to first consider 
the orbit structure of the linearization of (29.1.1) about x{t,Xo) given by 

i = Df{x{m, (29.1.2) 



Let X{t;x{t,xo)) be the fundamental solution matrix of (29.1.2) and let 
e yf 0 be a vector in R”. Then the coefficient of expansion in the direction 
e along the trajectory through xq is defined to be 



^ _ II^T2;(t,a;o))e|| 

ej — .... 



(29.1.3) 



where |[ • |[ = a/ (•, •) with (•,•) denoting the standard scalar product on 
M”. Note that Xt{xo,e) is a time-dependent quantity that also depends 
on a particular orbit of (29.1.1) (through the fundamental solution matrix 
X{t; xft, xo))), a particular point along this orbit, and a particular direction 
along this orbit. The Liapunov characteristic exponent (or just Liapunov 
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exponent) in the direction e along the trajectory through xq is defined to 
be ^ 

x{X{t]x{t,xo)),xo,e) = lim -logAt(xo,e). (29.1.4) 

t—yoo t 



We make several remarks concerning this definition. 



1. Equation (29.1.4) is an asymptotic quantity. Therefore, in order for 
it to make sense, we must at least know that x{t,xo) exists for all 
t > 0. This will be true if the phase space is a compact, boundaryless 
manifold or if xq lies in a positively invariant region. 

2. For the zero vector we define x{X{t; x{t, xq)),xo, 0) = — oo. 

3. The fundamental solution matrix, X{t] x{t,xo)), of (29.1.2) is asso- 
ciated with a particular trajectory, x(t,xo), of (29.1.1). Thus, if we 
considered a different trajectory, i(t,a;o), of (29.1.1), then the fun- 
damental solution matrix associated with the vector field linearized 
about x{t, Xq) may (and most probably will) have different properties. 

4. The Liapunov exponent in the direction e along the orbit through 
Xq is unchanged if the initial condition of the trajectory along the 
orbit is varied. In particular, for any ti G R, let x{ti,Xo) = Xi; 
then x{X{t;x(t,xo)),xo,e) = x{X{t;x(t,xo)),xi,e). This should be 
intuitively clear from the fact that the exponents are limits as t — >■ oo; 
see Exercise 1. 

5. In light of the previous remark it makes sense to drop the initial con- 
dition of the trajectory from the notation for a Liapunov exponent, 
i.e., x{X{t;x{t,xo)),xo,e) = x{X{t;x{t,xo)),e). Moreover, our dis- 
cussions in this subsection will be concerned with Liapunov exponents 
associated with a given trajectory, and, hence, a given fundamental 
solution matrix X(t; x{t, xq)). In this setting we will further simplify 
the notation by dropping the explicit dependence on the fundamental 
solution matrix X{t; x{t, a;o))> i-e., x{X{t; x{t, xq)), e) = x(e). 

6. In general, Liapunov exponents are not continuous functions of the 
orbits. This can be seen from Example 29.2.3. 



Next we want to develop some geometrical properties of the Liapunov 
exponents associated with a given trajectory. The following lemma is fun- 
damental in this regard. 



Lemma 29.1.1 For any vectors f,gG K", and nonzero constant c G K, 
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x(/ + 5) <max{x(/), x(g)}, (29.1.5) 



x{cg) = x{g)- 



(29.1.6) 



Proof: This follows immediately from the definition of Liapunov exponents 
given in (29.1.4). □ 

This lemma, along with the standard definition of a vector space, implies 
the following proposition. 



Proposition 29.1.2 For any r G K 

{ff e K” I x{g) < r} 



is a vector subspace o/M". 



From this proposition we can conclude that there are at most n (i.e., the 
dimension of the phase space) distinct Liapunov exponents associated with 
a trajectory. More precisely, we have the following result. 



Proposition 29.1.3 The set of numbers 

{X(5)}<,e«'“ , 

g^o 

takes at most n = dimlST values, which we denote by 
vi > ■ ■ ■ > Vs, 1 < s < n. 



Proof: See Exercise 2. □ 

Associated with the s distinct values for the Liapunov exponents we have 
s + 1 nested subspaces. 

Proposition 29.1.4 Let Li = {g G K" | x(ff) ^ ^i}- Then 
{0} = Ls+i C Ls C • • • C Li = M" 

with Li+i yf Li and x{g) = t'i if o.'nd only if g G Li\Li^i, 1 < i < s. 
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Proof: This is an immediate consequence of Proposition 29.1.2 and Propo- 
sition 29.1.3. □ 

Next we define the notion of the spectrum of Liapunov exponents asso- 
ciated with X{t]x{t,X(f)). 

Definition 29.1.5 (Spectrnm of X{t] x{t, a^o))) The numbers i^i, • • • , 
are referred to as the spectrum of Liapunov exponents associated with 
X{t;x{t,xo)), and denoted by sp {X(f;x(t,xo))). The multiplicity of Vi is 
denoted by ki, and is given by ki = dimLi — dimLi^i, 1 < i < s. 

Next we want to consider conditions under which the limit, rather than 
the lim sup, in (29.1.4) exists. First we need two definitions. 

Definition 29.1.6 (Normal Basis) A basis {ei, • • • , e„} o/M" is said to 
be a normal basis if 

71 n 

X^x(ei) < XI 

i=l i=l 

where {/i, • • • , /„} is any other basis of K". 

Definition 29.1.7 (Regular Family) The fundamental solution matrix 
X{t;x{t,Xo)) is called regular as t — >■ oo if 

1. limj_>oo i log I det X(t; a;(t, Xo))| exists and is finite, and 

2. for each normal basis {ci, • • • , e„} 

” 1 

Xx(ei)= lim -log|detX(t;a:(t,xo))|. 

^ t—^OO t 



Now we can state the main existence theorem due to Liapunov [1966]. 
Theorem 29.1.8 If X{t]x{t,xo)) is regular ast^oo, then 

X(e) = lim LogAt(xo,e) (29.1.7) 

t—^(X> t 

exists and is finite for any vector e € M" . 
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Proof: See Liapunov [1966], Oseledec [1968], or Benettin et al. [1980a]. □ 

This immediately raises the question of whether or not a particular 
fundamental solution matrix is regular. This problem was addressed by 
Oseledec [1968] and is answered (in a sense) in the Oseledec multiplica- 
tive ergodic theorem. Oseledec showed that, with respect to some invariant 
measure, almost all trajectories give rise to regular fundamental solution 
matrices. There are a number of technical issues associated with stating 
this fundamental theorem, and we refer the reader to Oseledec [1968] and 
Benettin et al. [1980a] for more details. 

Liapunov exponents are just one type of spectra one can associate with 
a linear, time- varying system. There are others (e.g., we introduced the 
notion of exponential dichotomy in Chapter 3) . A survey of different spectra 
for linear systems is given in Dieci and van Vleck [2002]. 

Some new insights into the behavior of Liapunov exponents, and their 
generalizations, have been obtained by Colonius and Kliemann by combin- 
ing methods of nonlinear geometric control theory with dynamical systems 
theory. See Colonius and Kliemann [1996b], and references therein. 
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Let us now consider a few examples. 

Example 29.2.1 (A Linear, Homogeneous, Constant Coefficient System). Con- 
sider the linear, scalar vector field 

X = ax, X G (29.2.1) 

where a is a constant. Equation (29.2.1) has three orbits, a; = 0, a; > 0, and 
a: < 0, but the fundamental solution matrix associated with each orbit is given 
by 

X{t) = e“*. (29.2.2) 

Thus, using (29.2.2) and (29.1.7), we see that each orbit of (29.2.1) has only one 
Liapunov exponent, and the Liapunov exponent of each orbit is a. Thus, if a > 0, 
trajectories of (29.2.1) separate exponentially as t — >■ oo. 



End of Example 29.2.1 



Example 29.2.2 (An Integrable System). Consider a planar, Hamiltonian sys- 
tem in a region of phase space where the vector field is given in action-angle 
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variables as follows 



7 = 0, 

e = Q{i), 



(7,61) € R+ X S\ 



Then, a trajectory of 



(29.2.3) is given by 

7 = constant, 

6{t) = fl(7)6 + 6o- 



Linearizing (29.2.3) about (29.2.4) gives 



Cl 

6 



0 

fr(l) 




(29.2.3) 



(29.2.4) 



(29.2.5) 



The fundamental solution matrix of (29.2.5) is easily computed and found to be 

^^^^^(c^(7)6 o)’ (29.2.6) 

where C is a constant. Letting 59 represent a vector tangent to a trajectory and 
51 a vector normal to a trajectory using (29.2.6) and (29.1.7), we easily obtain 

x{I,SI)=0, 

Xil,59)=0, 



for any 7 labeling a trajectory of (29.2.3) defined by (29.2.4). 



End of Example 29.2.2 

Example 29.2.3 (A Two-Dimensional, Autonomous, Nonlinear System). Con- 
sider the vector field 



3 

X = X — X , 

y = -y- (29.2.7) 

In Example 8.2.2 of Chapter 8, we saw that (29.2.7) has a saddle at {x, y) = (0, 0) 
and sinks at (±1,0). Moreover, the closed interval [—1,1] on the a;-axis is an 
attracting set; see Figure 29.2.1. 

We want to compute the Liapunov exponents associated with orbits in this 
attracting set. The attracting set [—1, 1] contains five orbits, the fixed points 
(x,y) = (0,0), (±1,0) and the open intervals (—1,0) and (0,1). Each orbit has 
two Liapunov exponents. We will compute the Liapunov exponents of each orbit 
individually. We let 5x = (1,0) denote a tangent vector in the x direction and 
5y = (0, 1) denote a tangent vector in the y direction. It should be clear that 
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FIGURE 29.2.1. 

at each point of the attracting set Sx and Sy are a basis of The Liapunov 
exponents for the three fixed points are trivial to obtain and we merely state the 
results. 



(0,0) 


X((0,0),(5a;) = +1, 




X((0,0),5y) = -1. 


(-1,0) 


X((-I,0),5a;) = -2, 




X((-I,0),<5y) = -1. 


(+1,0) 


x((l,0),5a:) = -2, 




X((l,0),<5?/) = -1. 



The Liapunov exponents for the orbits 0 < a; < 1, y = 0, and —1 < a; < 0, 
y = 0, require a little more work. Note that (29.2.7) is unchanged under the 
coordinate transformation x —>■ —x. Therefore, the Liapunov exponents for the 
orbit Q < X < 1, y = 0, are the same as for the orbit —1 < x < Q, y = Q. 



Linearizing (29.2.7) about (x(t),0) gives 



l-3a;^(t) 0 



0 



-1 



(29.2.8) 



where x(t) is a trajectory in 0 < a; < 
(29.2.7) gives 

x^(t) : 



1, y = 0. Integrating the a;-component of 



+ 1 



(29.2.9) 



Substituting (29.2.9) into (29.2.8), we obtain the fundamental solution matrix 
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X{t) = 






-2t\-i/2 




(29.2.10) 



Using (29.1.7) and (29.2.10), we obtain the Liapunov exponents 

Q<x<\, y = Q x((0<a;<l,J/ = 0), 5x) = —2, 

X((0 < a: < 1, y = 0), 5y) = -1, 

— 1 < a: < 0, y = Q x((~l < ® < 0> J/ = 0), 5a;) = — 2, 

X((-1 <x <Q,y = 0),5y) = -1. 



End of Example 29.2.3 



We end this section with some final remarks. 

Remark 1. We can view the Liapunov exponents of a given orbit as the 
long time average of the real parts of the eigenvalues of the fundamental 
solution matrix associated with the linearization of the vector field about 
the orbit. Therefore, they give us information concerning local expansion 
and contraction of phase space only and nothing about twisting and folding. 

Remark 2. It should be clear that if the orbit is a fixed point or periodic 
orbit, then the Liapunov exponents are, in the first case, the real parts 
of the eigenvalues associated with the matrix of the vector field linearized 
about the fixed point and, in the second case, the real parts of the Floquet 
exponents. Thus, in some sense the theory of Liapunov exponents is a 
generalization of linear stability theory for arbitrary trajectories (but see 
Goldhirsch et al. [1987]). 

This brings up an interesting point. Associated with the linear 
eigenspaces associated with vector fields linearized about fixed points and 
periodic orbits are manifolds invariant under the full nonlinear dynamics 
where orbits have the same asymptotic behavior as in the linearized system. 
We refer to these as the stable and unstable manifolds. Might an arbitrary 
orbit possess stable and unstable manifolds having dimension equal to the 
number of negative and positive Liapunov exponents, respectively, associ- 
ated with the orbit? The answer to this question is yes, and it has been 
proved by Pesin [1976], [1977], but see also Sacker and Sell [1974], [1976a], 
[1976b], [1978], and [1980]. 

We have only hinted in this section at the various properties of Lia- 
punov exponents. For more information the reader should consult Liapunov 
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[1966], Bylov et al. [1966], Oseledec [1968], Ledrappier and Young [1991], 
and Young [1982]. We develop many additional properties of Liapunov ex- 
ponents in the exercises. 



29.3 Numerical Computation of Liapunov 
Exponents 



In applications the Liapunov exponents of a trajectory will need to be 
computed numerically. There has been much rigorous work in recent years 
involving the accurate computation of Liapunov exponents. Two papers 
that discuss this problem with great detail and rigour are Dieci et al. [1997] 
and Bridges and Reich [2001]. The paper of Dieci et al. [1997] is notable 
for providing error estimates for the finite time computations^. Dieci [2002] 
describes algorithms for computing Liapunov exponents that do not require 
the computation of a Jacobian. Related work can be found in Rangarajan 
et al. [1998] and Janaki et al. [1999]. These numerical approaches involve 
either a QR or a singular value decomposition (SVD) of the fundamental 
solution matrix. While the theory of QR and SVD for constant matrices 
is well-known, the fundamental solution matrix varies in time. The theory 
for QR and SVD of time varying matrices is developed in Dieci and Eirola 
[1999] amd Dieci and van Vleck [1999]. Methods for computing Liapunov 
exponents that are specific to Hamiltonian systems can be found in Partovi 
[1999] and Yamaguchi and Iwai [2001] . The paper of Ramasubramanian and 
Sriram [2000] provides a comparative study between different algorithms 
for the computation of Liapunov exponents. The work of Udwadia and von 
Bremen [2001], [2002] is also notable. 



29.4 Exercises 



1. Recall the discussion earlier in this chapter. Show that 

x(xo,e) =x(x(T),e) 



for any finite T. 

2. Prove Proposition 29.1.3. 

3. Consider a planar, Hamiltonian vector field. Must all Liapunov exponents of every 
orbit be zero? 



^Liapunov exponents are asymptotic quantities. They are defined in the limit 
as time approaches infinity. Obviously, this limit cannot be realized on a com- 
puter, and therefore some error is incurred. 
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4. Show that any trajectory of a vector field that remains bounded and does not terminate 
on a fixed point must have at least one zero Liapunov exponent. {Hint: consider the 
direction tangent to the orbit.) 

5. In any realistic example, the Liapunov exponents of an orbit must be calculated nu- 
merically. In this case, you can see that a problem may arise. Namely, a Liapunov 
exponent is a number obtained in the limit t oo, and, in practice, one can only 
compute for a finite amount of time. 

Recall Example 29.2.3. Consider an initial condition 

(x,y) = (e,0). 



and the direction 

6x = (1,0). 



Let 



and compute 



Xt{xo, e) 



1 

t 



log \t(xo, e), 



Xt({e,0),Sx) 



for this example. It should follow from the discussion in the example that for some T 
we have 

Xt((e, 0), <5x) < 0, fG[T, oo). 

Let To{e) be the value of t such that 



XTo(e)((£, 0),<5ir) ^ 0. 

a) Compute To(e) and graph it as a function of e. 

b) What can you conclude from this example concerning the numerical computation 
of Liapunov exponents? 



6. Consider a linear velocity field given by: 



cos4t sin4t-2\ / \ (29 4 1! 

\X2 J ~ \sin4t-\-2 -cos4f J \X2 J ' 

Verify that the solution of (29.4.1) is given by: 

xi{t) — xiqs^ cos 2t — X 2 o^~^ sin 2f , 

a :2 (t) — a:ioe* sin 2t -1- a^ 20 ^ *cos2t. (29.4.2) 

(a) What is the fundamental solution matrix associated with (29.4.1)? 

(b) Compute the Liapunov exponents associated with this fundamental solution 
matrix. 
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Chaos and Strange Attractors 



In this chapter we want to examine what is meant by the term “chaos” 
as applied to deterministic dynamical systems as well as the notion of a 
“strange attractor.” We will begin by giving several definitions of properties 
that should be characteristic of “chaos” and then consider several examples 
that possess one, several, or all of these properties. 

We consider C” (r > 1) autonomous vector fields and maps on K” de- 
noted as follows 



vector field 
map 



X = f{x), 
X g{x). 



(30.0.1) 

(30.0.2) 



We denote the flow generated by (30.0.1) by (j){t,x) and we assume that 
it exists for all t > 0. We assume that A C M" is a compact set invariant 
under 4'{t,x) (resp. g{x)), i.e., 4>{t,A) C A for alH G K (resp. (?"(A) C A 
for all n G Z, except that if g is not invertible, we must take n > 0). Then 
we have the following definitions. 



Definition 30.0.1 (Sensitive Dependence on Initial Conditions) 

The flow <()(t, a;) (resp. g{x)) is said to have sensitive dependence on initial 
conditions on A if there exists e > 0 such that, for any x € A and any 
neighborhood U of x, there exists y € U and t > 0 (resp. n > 0) such that 
\<Kt,x) - (Kt,y) \ > e (resp. \g’^{x) -5”(y)| > e). 

Roughly speaking. Definition 30.0.1 says that for any point x G A, there 
is (at least) one point arbitrarily close to A that diverges from x. Some 
authors require the rate of divergence to be exponential; for reasons to be 
explained later, we will not do so. As we will see in the examples, taken by 
itself sensitive dependence on initial conditions is a fairly common feature 
in many dynamical systems. 



Definition 30.0.2 (Chaotic Invariant Set) A is said to be chaotic if 
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1. 4>{t,x) (resp. g(x)) has sensitive dependence on initial conditions on 

A. 

2. (j>{t,x) (resp. g{x)) is topologically transitive on A. 



Some authors (e.g., Devaney [1986]) add an additional requirement to 
Definition 30.0.2. For further discussions of the definition of chaos see Glas- 
ner and Weiss [1993] and Banks et al. [1992]. 

3. The periodic orbits of (j){t,x) (resp. g{x)) are dense in A. 



We will not explicitly include point 3 as part of the definition of a 
chaotic invariant set, but we will examine its importance and relationship 
to “chaos.” 

We will now consider several examples that exhibit the properties de- 
scribed in these two definitions. 

Example 30.0.1. Consider the following vector field on 

X = ax, X G 

with a > 0. The flow generated by (30.0.3) is given by 

cf)(t, x) = e°'^x. 

From (30.0.4) we conclude the following. 

1. (j>{t,x) has no periodic orbits. 

2. (j>{t, x) is topologically transitive on the noncompact sets (0, oo) and 
(-cx),0). 

3. 0(t, x) has sensitive dependence on initial conditions on since for any 
a;o,a;i G R^, with xo xi, 

\<j){t, Xo) - (j>{t,xi)\ = e“*|xo - xij. 

Hence, the distance between any two points grows (exponentially) in time. 



(30.0.3) 



(30.0.4) 



End of Example 30.0.1 
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Example 30.0.2. Consider the vector field 

. TT 

r = sin — , 
r 

e = r, (r,e)eR+xS\ 



(30.0.5) 



The flow generated by (30.0.5) has a countable infinity of periodic 
by 

(r(t), e{t)) = C, - + SoV n = 1, 2, 3, • • • . 

\n n } 



orbits given 
(30.0.6) 



It is easy to verify that the periodic orbits are stable for n even and unstable for 
n odd. Hence, in a compact region of the phase space, (30.0.5) has a countable 
infinity of unstable periodic orbits. We leave it as an exercise for the reader 
to verify that (30.0.5) may exhibit sensitive dependence on initial conditions 
in (open) annuli bounded by adjacent stable periodic orbits (see Exercise 2). 
However, (30.0.5) is only topologically transitive in the (open) annuli bounded 
by adjacent stable and unstable periodic orbits. 



End of Example 30.0.2 



Example 30.0.3. Consider the vector field on the two torus, x 

6l = LOl, 

02=002, (01,62) GT\ (30.0.7) 



with 



— = irrational. (30.0.8) 

UJ2 



Then, it follows from Chapter 10, Section 10.4a, that the flow generated by 
(30.0.7) is topologically transitive on T^. From (30.0.8) it is easy to see that 
the flow generated by (30.0.7) has no periodic orbits. We leave it as an exercise 
for the reader (see Exercise 3) to show that on the flow generated by (30.0.7) 
does not have sensitive dependence on initial conditions. 



End of Example 30.0.3 



Example 30.0.4. Consider the following integrable twist map 



( ^ 

\2nD.(I)+6 



_ ffi(i,0) 
f 2(1,0) 



(I,6)eR+xS\ (30.0.9) 



and 



-^(J) ^ 0 (twist condition). 



(30.0.10) 
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The iterate of (30.0.9) is easily calculated and is given by 

\2TvnQ{i) + 9) - 



(30.0.11) 



The simple form of (30.0.9) and (30.0.11) enable us to easily verify the following. 

1. Equation (30.0.9) is not topologically transitive, since all orbits remain on 
invariant circles. 

2. The periodic orbits of (30.0.9) are dense in the phase space. This uses the 
twist condition. 

3. Equation (30.0.9) has sensitive dependence on initial conditions due to the 
twist condition. This can be seen as follows. For (Iq, 6q), {Ii, 6i) G R"*" x , 
with Jo yf /i, we have 

\{fi(io,eo) - /r(/i,0i)), if 2 (io,eo) - f2ih,ei))\ 

= |((/o-/i),(27rn(fl(7o)-fl(/i)) + (6'o-6li))|. (30.0.12) 



Therefore, from (30.0.10), fl(/o) — fl(7i) yf 0. Thus, we see from (30.0.12) that 
as n increases, the 9 components of nearby points drift apart. However, the rate 
of separation is not exponential. 



End of Example 30.0.4 



Example 30.0.5. Consider 

cr:S^ — (30.0.13) 



where is the space of bi-infinite sequences of N symbols and a is the shift 
map as described in Chapter 24. Then we have proven the following. 

1. is compact and invariant (see Proposition 24.1.4). 

2. a is topologically transitive, i.e., cr has an orbit that is dense in (see 
Proposition 24.2.2). 

3. (7 has sensitive dependence on initial conditions (see Section 23.5). 

4. (7 has a countable infinity of periodic orbits (see Proposition 24.2.2) that 
are dense in E^ (see Exercise 4). 

Thus, E^ is a chaotic, compact invariant set for cr. From Chapters 25, 26, 
and 27 we know that two-dimensional maps and three-dimensional autonomous 
vector helds may possess compact invariant sets on which the dynamics are topo- 
logically conjugate to (30.0.13). In all of these cases, homoclinic (and possibly 
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heteroclinic) orbits are the underlying mechanism that gives rise to such behav- 
ior. This is important, because this knowledge enables us to develop techniques 
(e.g., Melnikov’s method) that predict (in terms of the system parameters) when 
chaotic dynamics occur in specific dynamical systems. 



End of Example 30.0.5 



We make the following remarks concerning these examples. 

Remark 1. Example 30.0.1 illustrates why we require chaotic invariant sets 
to be compact. 

Remark 2. Example 30.0.2 illustrates why having an infinite number of 
unstable periodic orbits in a compact, invariant region of phase space is 
not by itself a sufficient condition for chaotic dynamics. 

Remark 3. Example 30.0.3 describes a vector field having a compact invari- 
ant set (which is actually, the entire phase space) on which the dynamics 
are topologically transitive, but it does not have sensitive dependence on 
initial conditions. 



Remark 4- Example 30.0.4 describes a two-dimensional integrable map that 
has sensitive dependence on initial conditions and the periodic orbits are 
dense in the phase space, but it is not topologically transitive. 

Thus, taken together. Examples 30.0.1 through 30.0.4 show the impor- 
tance of Definition 30.0.2 being satisfied completely. Example 30.0.5 shows 
how chaotic dynamics can arise in many dynamical systems. However, it 
does not address the question of observability. 



Definition 30.0.3 (Strange Attractor) Suppose A C K” is an attrac- 
tor. Then A is called a strange attractor if it is chaotic. 



Hence, if we want to prove that a dynamical system has a strange at- 
tractor we might proceed as follows. 



Step 1. Find a trapping region, M, in the phase space (see Definition 8.2.2 
in Chapter 8). 

Step 2. Show that A4 contains a chaotic invariant set A. In practice, this 
means showing that inside Ad is a homoclinic orbit (or heteroclinic 
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cycle) which has associated with it an invariant Cantor set on which 
the dynamics are topologically conjugate to a full shift on N symbols 
(recall Chapters 26 and 27). 

Step 3. Then, from Definition 8.2.1 in Chapter 8, 



t>o 




(30.0.14) 



is an attracting set. Moreover, A C A (see Exercise 5) so that A 
contains a mechanism that gives rise to sensitive dependence on initial 
conditions; in order to conclude that M is a strange attractor we need 
only demonstrate the following. 

1. The sensitive dependence on initial conditions on A extends to 

2. M is topologically transitive. 



Hence, in just three steps we can show that a dynamical system possesses 
a strange attractor. In this book we have developed techniques and seen 
examples of how to carry out Steps 1 and 2. However, the third step is the 
killer, namely, showing that A is topologically transitive. This is because a 
single, stable orbit in A will destroy topological transitivity and, in Chap- 
ter 32, we will see that periodic sinks are always associated with quadratic 
homoclinic tangencies. Moreover, as a result of Newhouse’s work, at least 
for two-dimensional dissipative maps, these homoclinic tangencies are per- 
sistent in the sense that if we destroy a particular tangency we will create 
another elsewhere in the homoclinic tangle. This is largely the reason why 
there is yet to be an analytical proof of the existence of a strange attractor 
for the periodically forced, damped Duffing oscillator, despite an enormous 
amount of numerical evidence. However, Wang and Young [2002] give a 
result that proves the existence of a strange attractor for “sufficently large 
damping” . 

At present, there exist rigorous results concerning strange attractors (by 
our Definition 30.0.3) in the following areas. 



1. One- Dimensional Non-Invertible Maps. For maps such as 

X I— >■ /icc(l — x) 



X ^ x^ — pL 



or 
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with /i a parameter there now exists a fairly complete existence theory 
for strange attractors. The reader should consult Jakobsen [1981], Mi- 
siurewicz [1981], Johnson [1987], Guckenheimer and Johnson [1990], 
and de Melo and van Strien [1993]. 

2. Hyperbolic Attractors of Two-Dimensional Maps. Plykin [1974], Ne- 
mytskii and Stepanov [1989] and Newhouse [1980] have constructed 
examples of hyperbolic attracting sets which satisfy Definition 30.0.2. 
These examples are somewhat artificial in the sense that one would 
not expect them to arise in Poincare maps of ordinary differential 
equations that arise in typical applications. 



3. Lorenz-Like Systems. The topology associated with the Lorenz equa- 
tions (see Sparrow [1982]) avoids many of the problems associated 
with Newhouse sinks. Consequently, in the past few years there has 
been much progress in proving that the Lorenz equations (along with 
slightly modified versions of the Lorenz equations) possess a strange 
attractor. The reader is referred to Sinai and Vul [1981], Afraimovich, 
Bykov, and Silnikov [1983], Rychlik [1990], and Robinson [1989]. Re- 
cently, an elegant computer assisted proof of the existence of a strange 
attractor for the Lorenz equations was given by Tucker [1999]; see also 
Morales et al. [1998] and the popular articles of Stewart [2000] and 
Viana [2000]. 

4. The Henon Map. The Henon map is defined to be 



y, 

y -ex + p,-y^, 



(30.0.15) 



where e and pL are parameters. Over the past ten years a large amount 
of numerical evidence has suggested the existence of a strange attrac- 
tor in this map. Recently, Benedicks and Carleson [1991] have proven 
that, for e small, (30.0.15) does indeed possess a strange attractor. 

5. Recent Results on Strange Attractors. For recent work on strange 
attractors see Palis and Takens [1993], Mora and Viana [1993], Diaz 
et al. [1996], Naudot [1996], and Turaev and Silnikov [1998]. 

6. Strange Attractors for Nonautonomous Vector Fields. If the nonau- 
tonomous vector field is time-periodic, then its study can be reduced 
to an associated Poincare map (Chapter 10) and strange attractor 
results for maps can be applied. A very thorough discussion of this 
can be found in Wang and Young [2002]. 

If the vector field is not time-periodic, then many new issues arise. 
The notion of an attractor for general nonautonomous systems was 
described in Chapter 8, Section 8.4. In addition to the notion of at- 
tractors for such vector fields, a definition and characterization of 
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chaos must be given. This has been considered in Scheurle [1986], 
Wiggins [1988], [1999], Stoffer [1988a,b], Meyer and Sell [1989], and 
Lerman and Silnikov [1992]. With notions of “attraction” and “chaos” 
in hand for nonautonomous vector fields with arbitrary time depen- 
dence one should be able to develop a notion of “strange attractor” . 
Nevertheless, this is a subject which has not been developed, both 
from the point of view of rigorous mathematical results, and specific 
examples. 



Thus, the “strange attractor problem” is still far from being solved in 
general. In particular, there is a need for thoroughly studied examples in 
higher dimensions, for vector fields in dimensions larger than three and 
for maps of dimension larger than two. A variety of examples of systems 
undergoing (rigorously proven) chaotic behavior in high dimensions can be 
found; however, the attractive nature of the chaos in these examples has 
not been studied. 



If a dynamical system possesses a chaotic invariant set, then an obvious 
question arises: namely, how is the chaos manifested in terms of “random” 
or “unpredictable” behavior of the system? The answer to this question 
depends on the geometry of the construction of the chaotic invariant set 
and, thus, the answer varies from problem to problem (as we should expect). 
Let us consider an example, our old friend the periodically forced, damped 
Duffing oscillator. 



This system is given by 
x = y, 

y = X — + e{—Sy + ^ cos cot). 



(30.0.16) 



We know from Chapter 28 that, for £ sufficiently small and S < 
^ 3-kw sec^7ro;/2) ^ Poincare map associated with (30.0.16) possesses 
transverse homoclinic orbits to a hyperbolic fixed point. Theorem 26.0.5 
implies that (30.0.16) has chaotic dynamics. However, we want to interpret 
this chaos specifically in terms of the dynamics of (30.0.16). We will do 
this by constructing the chaotic invariant set geometrically and describ- 
ing the associated symbolic dynamics geometrically. Our discussion will be 
heuristic, but, at this stage, the reader should easily be able to supply the 
necessary rigor (see Exercise 7). 

Consider the two “horizontal” strips labeled and i7_ in Figure 30.0.1. 

Under the Poincare map, denoted P, and are mapped over them- 
selves in iterates as shown heuristically in Figure 30.0.1. It should be clear 
that the horizontal (resp. vertical) boundaries of and i7_ map to hor- 
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izontal (resp. vertical) boundaries of P^{H+) and respectively. 

Thus, one can show that Assumptions 1 and 3 of Chapter 25 hold (see Ex- 
ercise 7). Therefore, i7+Ui7_ contains an invariant Cantor set. A, on which 
the dynamics are topologically conjugate to a full shift on two symbols. A 
point starting in An makes a circuit around the right-hand homoclinic 
tangle before returning to A. A point starting in A fl makes a circuit 
around the left-hand homoclinic tangle before returning to A. A symbol 
sequence such as 



thus corresponds to an initial condition starting in P [_ , going to i7_|_ under 
P'^ (hence making a circuit around the left-hand homoclinic tangle), and 
then going back to i7_ under P^ (hence making a circuit around the right- 
hand homoclinic tangle), etc. The geometrical meaning of “chaos” should 
be clear for this system, but the reader should do Exercise 7. 

We end this section with some final remarks. 

Remark 1. The dynamics of the full shift on N symbols best describes 
what we mean by the term “chaos” as applied to deterministic dynamical 
systems. The system is purely deterministic; however, the dynamics are 
such that our inability to precisely specify the initial conditions results in 
behavior that appears random or unpredictable. 



Remark 2. We did not include in our definition of the chaotic invariant 
set (Definition 30.0.2) the requirement of density of periodic points. If the 
chaotic invariant set is hyperbolic, then, by the shadowing lemma (see, e.g., 
Shub [1987]), it follows immediately that the periodic points are dense. 
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Moreover, Grebogi et al. [1985] have obtained numerical evidence for the 
existence of chaotic attractors in maps of the A'^-torus which have the prop- 
erty that orbits in the attractor densely cover the fV-torus. 

Remark 3. In our definition of sensitive dependence on initial conditions 
(Definition 30.0.1) we did not require the separation rate to be exponential. 
This is because it appears now that the strange attractors observed in 
numerical experiments of typical dynamical systems arising in applications 
will not, in general, be hyperbolic. Hence, one should expect parts of the 
attractor to exhibit nonexponential contraction or expansion rates. This is 
an area where new analytical techniques need to be developed. 

Remark 4- Positive Liapunov exponents have been a standard criterion for 
deciding when a dynamical system is “chaotic” over the past few years. 
Examples 29.2.1 and 29.2.3 show that this criterion should be interpreted 
with caution; see also Exercise 5 following Chapter 29. 

Remark 5. In an interesting series of papers. Brown and Chua [1996a, b], 
[1998] describe relationships between different characterizations of chaos in 
great detail. 



30.1 Exercises 



1. Can a C’’ map or flow depend on initial conditions in a manner and also exhibit 
sensitive dependence on initial conditions? Explain. 

2. Recall Example 30.0.2. Do all or only some orbits in the open annuli bounded by 
adjacent stable periodic orbits exhibit sensitive dependence on initial conditions? 

3. Recall Example 30.0.3. Show that the flow generated by (30.0.7) is topologically tran- 
sitive on T^. 

4. Show that for the dynamical system 

the periodic orbits are dense in 

5. Let 

X I— g{x ) , X G , 

be a (r > 1) map. Suppose Ai C is a trapping region with A C Ai a chaotic 
invariant set. Then defining 

.4= fl s"(X1). 

n>0 

show that 

A C 

6. Very often one hears the phrase, 
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A dynamical system is chaotic if it has one positive Liapunov exponent. 

Discuss what this phrase means in light of the discussion in Sections 29 and 30. Con- 
sider both dissipative and nondissipative systems. 

7. Recall the discussion of chaos in the phase space of the damped, periodically forced 
Duffing oscillator at the end of Section 30. The goal of this exercise is to make the 
heuristic arguments given in that discussion rigorous. 

a) Draw the homoclinic tangle correctly for the Poincare map on a given cross- 
section {say E^). You may want to use a computer. 

b) Find candidates for two ///j,-horizontal strips, denoted Hq and ifi, which map 
over themselves in /^^;-vertical strips under some iterate of the Poincare map 
so that Assumptions 1 and 3 of Chapter 25 are satisfied. Choose the horizon- 
tal strips so that the relationship between the motion in phase space and the 
dynamics on the invariant set is as described in Chapter 30. 

c) Describe the relationship between the number of iterates needed to form the 
chaotic invariant set and the parameters 7 — <5 — a; (see Holmes and Marsden 
[1982] for help). 

8 . Often one hears the phrase, 

For diffeomorphisms of dimension two and larger and for vector fields 
of dimension three and larger, homoclinic orbits produce chaos. 

Is this statement generally true? Give a complete discussion with examples. 

9. For Example 30.0.4 prove that the periodic orbits are dense. 
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Hyperbolic Invariant Sets: A 
Chaotic Saddle 



We now want to show that the invariant Cantor set A constructed in Chap- 
ter 25 has a very special structure. In particular, it is an example of a 
hyperbolic invariant set. Hyperbolic invariant sets are examples of chaotic 
saddles, a terminology that has developed over the past 10 years. We will 
comment more on this at the end of this chapter. 

The notion of hyperbolicity played a central role in the development of dy- 
namical systems theory. We will begin this chapter by giving the necessary 
definitions and constructions in the context of two dimensional diffeomor- 
phisms (we follow Moser [1973]). This has the advantage of allowing us 
to give rather complete proofs of results using fairly simple mathematics 
(the mathematical tools are simple, the reasoning and constructions with 
those tools is more complicated). Afterwards we will discuss the general 
n-dimensional result. 



31.1 Hyperbolicity of the Invariant Cantor Set A 
Constructed in Chapter 25 



We begin with the following definition. 



Definition 31.1.1 Let /: — >■ be a C’’ (r > 1) dijfeomorphism and 

let A C be a compact set that is invariant under f. Then we say that A 
is a hyperbolic invariant set if 

1. At each point zq G A there exists a pair of lines, and that 
are invariant under Df{zo) in the sense that 



and 
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2. There exists a constant 0 < A < 1 such that if 

Go = G then |D/(zo)GoI < ^IGol 

and if 

Go = (Go:%o) G l^/"^Go)Gol < •^IGol- 

where |Czol = \/(Go)^ + {VzoY- 

3. El^ and Ef^ vary continuously with zq G A. 

We make the following remarks concerning this definition. 

Remark 1. It should be clear that hyperbolic fixed points and hyperbolic 
periodic orbits are examples of hyperbolic invariant sets. 

Remark 2. E^ = UzogA-^zo = UzogA-^G called the invariant 

stable and unstable line bundles over A, respectively. 

We now state the theorem that A is a hyperbolic invariant set. 



Theorem 31.1.2 (Moser [1973]) Consider the O'" (r > 1) diffeo- 
morphism f and its invariant set A described in Theorem 25.2.1. Let 
A = sup^(det £>/). Then if 

A,A”i 

where 0 < /i < 1 — yivfJ'h, A is a hyperbolic invariant set. 



Proof: We begin by constructing the unstable invariant line bundle over A, 
First, we want to recall some important points from the 
hypotheses of Theorem 25.2.1. 



i) 



^ZQ — {(Go:''?zo) G R I ICzol A TvlVzol}- (31.1.1) 



ii) 



DfiSif,) C 5G 



(31.1.2) 



iii) 



For i^zo,Vzo) G 5“, Df{zo){fzo,Vzo) = (G(^o): ^/Co)) G S^(zo)^ 
have 

\Vf{zo)\>-\Vzo\, (31.1.3) 



where 0 < /x < 1 — pivti-h. 
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The construction of will be by the contraction mapping principle. We 
define 

C\ = {continuous line bundles over A contained in 
“Points” in C\ will be denoted by 

£X(/3(^o))= U (31-1-4) 

zqGA 

where 

^a(zo) ~ {(?zo)^zo) S I ?zo = Ct(zo)?7zo}, 

G K" l?zo = f3{zo)Vzo}, (31. 1.5) 

with a{zo), I3{z(i) continuous functions on A and 

sup |a(2:o)| < 

zoSA 

sup |/3(zo)l < ^J-v■ 

zqGA 

As notation for a line in a line bundle, say L\{a{z(i)), at a point 2:0 C A we 
have 

C\ is a complete metric space with metric defined by 

\\C\{a{zo)) - C\{l3{zo))\\ = sup \a{zo) - (3 {,zq)\. (31.1.6) 

zoSA 

From (31.1.6), the geometrical meaning of the continuity of a line bundle 
should be clear. 

We define a map on C\ as follows. For any C\{a{zo)) G C\ we have 

(F(£X(«(^o)))),„ = 7?/(/-i(zo))A“(/-i(zo))- (31.1.7) 

From the fact that Df(S^) C 5y, it follows that 

F{Cl) c Cl. 



(31.1.8) 
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We now show that F is a contraction map. Choose C\{a{zo)), C\{j3{zo)) G 
then we must show that 

\\F{Cl{a{z^))) - F(£X(/3(^o)))|| < k\\Cl{a{z^)) - a(/3(^o))||, (31.1.9) 

where 0 < fc < 1. 

From (31.1.8), 

F{Cl{a{z,)))=Ll{a*{z^))€Ll, 

F(£X(/3(^o)))=/:X(r(^o))ea, 

and we must compute a*{zo) and /3*{zo) in order to verify (31.1.9). 

Let us denote, for simplicity of notation, 

D/.(“ J), (31.1.10) 

where, of course, a, b, c, and d are the appropriate partial derivatives of 
/ = {fi, f 2 ) and are therefore functions of zq. However, to carry this along 
in the formulae would result in very cumbersome expressions. Hence, we 
remind the reader to think of the partial derivatives a, b, c, and d as being 
evaluated at the same point as Df{-) even though we will not explicitly 
display this dependence. 

Using (31.1.10), we have 

a?zo kTjzQ = ^f(zo)j 

C^zo + dVzo=Vf{zo)- (31.1.11) 

Consider an arbitrary line 

L^izo) = {iU,Vzo) G R" l?.o = ^{zo)Vzo} G £l{a{zo))-, (31.1.12) 

then 

F/(/-i(^o))L“(y-i(,„)) ^ (31.1.13) 

and, using (31.1.11) and (31.1.12), we have 
_ f aa{f-^zo)) + b\ _ 

Vca(/-H^o)) + dj ^ (31.1.14) 
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Thus, 



a*{zo) 



aa(f ^(zo)) + b 

ca(/-i(zo)) + d 



is also continuous over A. We have thus shown that 
F{Cl{a{zo)))=Cl{a*{zo)), 



where 



a*{zo) 



actjf '^{zo)) + b 
ca(/-i(zo)) + d' 



Similarly, 

F{Cl{f3{zo))) = Clif3*izo)), 



where 



P*{zo) 



afd{f \zp)) + b 

c/3(/"H^o)) + d' 



Now, using (31.1.6), we have 



\\F{jri{a{zo))) - F{jri{(3{zo))\\ = sup \a*{zo) - P*{zo)\. 

Zq^A 



Using (31.1.15) and (31.1.16), we obtain 



\a*{zo) - (3*{zo)\ < A 



Hf-Hzo)) - /3{f-Hzo))\ 
|ca(/-i(zo)) + d\ |c/3(/-i(zo)) + d\ ' 



From (31.1.3) and (31.1.11) we have 



\Vf~H.o)\ 



F-‘«' +d 

df-Hzo) 



> 



1 



and, therefore, since C/-i(zo) ~ *^(/ ^(^o))''7/~i(zo)) have 
\ca{f~'^{zo)) + d\, \c(3{f~'^{zo)) + d\ > - 



where 0 < /t < 1 — 



(31.1.15) 



(31.1.16) 



(31.1.17) 



(31.1.18) 



(31.1.19) 



(31.1.20) 



Combining (31.1.18) and (31.1.20) gives 
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|a*(zo) - r U)| < M"A|a(/-i(zo)) - P{r\zo))\- (31.1.21) 

Note that since A is invariant under /, we have 

sup \a{f~^{zo)) - P(f~^{zo))\ = sup \a{zo) - (i{zo)\. (31.1.22) 

Taking the supremum of (31.1.21) over zq G A and using (31.1.22), (31.1.6), 
and (31.1.17) gives 

||F(£X(a(^o))) -F(£X(/3 (^o)))|| 

< m"A||£X(«(^o)) - Cl{f}{zo))\\-, (31.1.23) 

thus A’ is a contraction map provided 

Ai^A < 1. (31.1.24) 



Therefore, by the contraction mapping principle, F has a unique, continu- 
ous fixed point. We denote this fixed point by 

(31.1.25) 



Thus we have 

F{E^) = 



or, from (31.1.13), 

(F(if“)),„ = Z?/(/-i(zo))i^;-i(,„) = El, 

which is what we wanted to construct. The construction of the stable line 
bundle over A is virtually identical and we leave it as an exercise for the 
reader. 

This shows that A satisfies Part 1 of Definition 31.1.1. Part 3 follows 
from continuity of the fixed point of F (by the contraction mapping prin- 
ciple). Continuity is measured with respect to the metric (31.1.6) so the 
geometrical meaning should be clear. Part 2 of Definition 31.1.1, expansion 
and contraction rates, is a trivial consequence of Assumption 3 that we 
leave as an exercise for the reader. □ 
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31.1a Stable and Unstable Manifolds of the 
Hyperbolic Invariant Set 



Theorem 3 1. 1. 2 gives us information on the linearization of / at each point 
of A, but we can also obtain information on / itself. Recall from the proof 
of Theorem 25.2.1 that 

A = A_oo O Aqo j 

with 

A-oo = U_i...s_fc... ) 

S-iGS 

Aoo = Uso 'SIc ' > 

BiGS 

where, for each infinite sequence of elements of S, U.i. and 
are /Xt,-vertical and /ift,-horizontal curves, respectively, with 0 < 

Thus, for any zq € A, there exists a unique /r„-vertical curve in A_oo, 

such that 



C'’ (r > 1) diffeo- 
Theorem 25.2.1. Let 



U_i...s_fc...) and a unique /ift,-horizontal curve in Aq 

Zq = ls_,...s_t... 0 Hg ...g..... 



Moreover, we can prove the following theorem. 



Theorem 31.1.3 (Moser [1973]) Consider the 
morphism f and its invariant set A described in 
A = supA(det H/). Then if 



0 < 



M < min f v1^, ^ 



the curves in A_oo and Aoo are curves whose tangents at points in A 
coincide with and if®, respectively. 



Proof: The proof follows the same ideas as Theorem 31.1.2; in Exercise 2 
we outline the steps one must complete to establish the theorem. □ 



In some sense we can think of these curves as defining the stable and 
unstable manifolds of points in A. The details of this are worked out in the 
exercises. But first we must first give some definitions. We will give these 
definitions in the general n-dimensional setting. 
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31.2 Hyperbolic Invariant Sets in 



Consider a C’’ (r > 1) diffeomorphism 

/ : K” ^ K”, 



and let A be a compact invariant set for /. 

Definition 31.2.1 (Hyperbolic Invariant Set) A is said to he o hyper- 
bolic invariant set if for each p G A there is a splitting 

M” = £;; + e ;, 

which varies continuously with p G A, and constants C>0, 0<A<1 
such that 

1. (Invariance of the Splitting^ 

Df{p)E; = 

Df{p)E; = 

2. (^Contraction and Expansion j 

\Dr{p)v\<CX^\v\, yvGE;,pGA, 

\Df-'-{p)v\<CX'^\v\, yvGE;,pGA. 



For any point p G A, s > 0, the stable and unstable sets of p of size e are 
defined as follows 



Wfip) = {/ € A I |/”(p) - /”(p')l < £ for n > 0}, 
W:^(p) = {p'gA\ |/-"(p) - /-"(p')| < £ forn > 0}. 



From Chapter 3, we have seen that if p is a hyperbolic fixed point the 
following hold. 



1. For e sufficiently small, Wf(p) is a C’’ manifold tangent to E^ at p 
and having the same dimension as E®. Wf{p) is called the local stable 
manifold of p. 
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2. The stable manifold of p is defined as follows 

OO 

w%p) = u r^iw^ip)). 

n—O 



Similar statements hold for W^{p). 

The invariant manifold theorem for hyperbolic invariant sets (see Hirsch, 
Pugh, and Shub [1977]) tells us that a similar structure holds for each point 
in A. 



Theorem 31.2.2 Let K he a hyperbolic invariant set of o C’’ (r > 1) 
dijfeomorphism f. Then, for £ > 0 sufficiently small and for each point 
p € A, the following hold. 



i) Wf{p) and Wf{p) are C’’ manifolds tangent to Ef, and E^, respec- 
tively, atp and having the same dimension as Ef, and E^, respectively. 

ii) There are constants C > 0, 0 < A < 1, such that if p' € Wf(p), then 

\r{p)~ r{p')\<CX^\p-p'\, for n >0 

and if p' G Wf{p), then 

|/-"(p)-/-"(p')| <CA"b-p'| for n>0. 



iii) 

nwfip)) c wfifip)), 
r\wf{p)) c w:if-\p)). 

iv) Wf{p) and Wf{p) vary continuously with p. 



Proof: See Hirsch, Pugh, and Shub [1977]. 



□ 



With Theorem 31.2.2 in hand, one can then define the global stable and 
unstable manifolds of any point p G A as follows 

OO 

w^p) = u .r-iwfirip))), 

n— 0 

OO 

w-{p) = u .r(w;(/-"(p))). 

n—O 
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We refer the reader to the exercises for more detailed studies of the issues 
raised here in the context of the hyperbolic invariant set constructed in 
Theorem 31.1.2 and we end this section with some final remarks. 

Remark 1: Structural Stability. Hyperbolic invariant sets (like hyperbolic 
fixed points and periodic orbits) are structurally stable; see Hirsch, Pugh, 
and Shub [1977]. 

Remark 2: History. The reason for discussing the concept of hyperbolic 
invariant sets is that they have played a central role in the development 
of modern dynamical systems theory. The definitions of Anosov diffemor- 
phisms and Axiom A diffeomorphisms rely crucially on hyperbolicity, and 
their study has been important for the development of many concepts and 
theorems in dynamical systems theory. For example, the ideas of Markov 
partitions, pseudo orbits, shadowing, etc., were all developed initially in 
these contexts and all utilize crucially the notion of a hyperbolic invariant 
set^. 

Indeed, the existence of a hyperbolic invariant set is often assumed a 
priori. This has caused the applied scientist great difficulty since, in order 
to utilize many of the techniques or theorems of dynamical systems theory, 
he or she must first show that the system under study possesses a hyperbolic 
invariant set. The techniques developed here, specifically the preservation of 
sector bundles, allow one to explicitly construct hyperbolic invariant sets. 
For more information on the consequences and utilization of hyperbolic 
invariant sets see Smale [1967], Nitecki [1971], [1978], Conley [1978], Palis 
and de Melo [1982], Shub [1987], Franks [1982], Palis and Takens [1993], 
and Katok and Hasselblatt [1995]. 

Remark 3: Chaotic Saddles. Hyperbolic invariant sets were studied in great 
detail in the 1960’s; well before the term “chaos” was popularized in the 
context of dynamics. Hyperbolic invariant sets are examples of chaotic sad- 
dles; a notion around which many papers have been published over the past 
10 years. See, for example, Lai et al. [1993], Thompson et al. [1994], Ashwin 
et al. [1996], Dhamala and Lai [1999], Kapitaniak et al. [1999], Kapitaniak 
[2001], and Robert et. al [1998], [2000]. Most of these works are numerical 
in nature and fail to recognize the related pioneering work of the 60’s and 
70’s. Much can be learned about “chaotic saddles” by studying the semi- 
nal works of Bowen [1970a,b], [1972], [1973], [1975a] and Newhouse [1972]. 



^Anosov diffeomorphisms are those where the entire phase space is a hyper- 
bolic invariant set for the diffeomorphism. Axiom A diffeomorphisms are those 
having the property that their set of nonwandering points (cf. Definition 8.1.5) 
is hyperbolic and the the set of nonwandering points is the closure of the set of 
periodic orbits. 
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Nevertheless, the numerical studies of the last 10 years on chaotic saddles 
do go beyond the previous work in that they consider the more complex 
issues of loss of hyperbolic! ty and bifurcation. This is an important area of 
research for which there are not many theorems, at the moment. 



31.2a Sector Bundles for Maps on M"- 

In this section we state the higher dimensional analog of Theorem 31.1.2. 
As above, let /: K" — >■ K" be a C"" (r > 1) diffeomorphism and let A be 
a closed set which is invariant under /. Let M” = ifp © be a splitting 
of M" for p G A and let p{p) be a positive real valued function defined on 
A. We define the fj,{p) sector, denoted S'^(p), as follows 

“ { i^pjVp) ^ Ep I ICpI ^ m(p) I^pI } (31.2.1) 



and we define the complementary sector, as follows 

^;.(p)=®”-^Mp)- (31-2.2) 

Then we have the following theorem. 



Theorem 31.2.3 Let /: K” — >■ M” be a (r > 1) diffeomorphism 
and let A C K” he a closed set which is invariant under f. Then A is a 
hyperbolic invariant set if and only if there exists a splitting K" = 
for each p G A, an integer n > 0, constants C > 0, 0<A<1 with 
CA" < 1, and a real valued function /r: A — >■ R+ such that the following 
conditions are satisfied: 



1 ) 



sup {max(/r(p), ^(p) ^)} < oo. 
pSA 



2) For each p G A, we have 



(31.2.3) 



a) Drip) ■ C 5^(/n(p)) 

b) */epG^;(p), \Dnp)^p\<cx^\^p\ 

C) tff,p&S^(pp |Ar”(p)?p| <CA"|ep|. (31.2.4) 



The proof of this theorem can be found in Newhouse and Palis [1973]. 
Theorem 31.2.3 tells us that in order to establish hyperbolicity for A we 
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need only find bundles of sectors •S' = U *5" = U 'S'u(p). such that 

peA peA 

Df maps S into S while expanding each vector in S and Df maps S' into 
S' while contracting each vector in S' . 

Now regarding Theorem 25.2.1 from Chapter 25, the reader should notice 
that if A1 and A3 hold then the invariant set A is a hyperbolic invariant 
set since the conditions of A3 are weakened versions of the necessary and 
sufficient conditions for a set to be hyperbolic given in Theorem 31.2.3. 



31.3 A Consequence of Hyperbolicity: The 
Shadowing Lemma 



We will end this section by discussing the shadowing property associated 
with hyperbolic invariant sets. Two recent monographs on shadowing are 
Lani-Wayda [1995] and Pilyugin [1999]. We follow the proof of Robinson 
[1977]. First we need some defintions. 



Definition 31.3.1 (e pseudo orbit) An infinite e pseudo orbit is a dou- 
bly infinite sequence of points 

{xi, I i G 2} 



such that 

|/(x3 - Xi+i\ < e. 



Definition 31.3.2 Suppose U is a neighborhood of a hyperbolic invariant 
set A where the hyperbolic splitting holds. Then we say that U is a set where 
f is hyperbolic. 

Lemma 31.3.3 (Shadowing Lemma) Let U G K" be a region where f 
is hyperbolic. Given <5 > 0 there exists e > 0 such that if {xi, \ i G 2} is an 
e pseudo orbit in U then there exists a unique y such that 

\f{y)-Xi\ < S 

for all i € ^ , i.e. every e pseudo orbit is 6 shadowed by an orbit of f. 

Proof: Consider a neighborhood [/ of A where / is hyperbolic. Then the 
invariant splitting of K" over A (i.e., K" = A® + Ef, x G A) extends to an 
“almost invariant” splitting” over U. 
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For each x G U consider the disks of radius 6/2 in E/. and EJ/. The 
cartesian product of these disks gives a neighborhood B{x) of x. 

For 6 sufficiently small, / is close to Df in B{x). Thus, in B{x) the 
contraction and expansion properties of / are close to those of Df. 

If e is small enough, and {xi \ i G 2} is an e pseudo orbit in U, then 
f{B{xj-i)) stretches across B{xj) in the unstable directions and is con- 
tracted in the stable directions. Then / (/ (B{xj-i)) fl B{xj-i)) fl B{xj) is 
an even thinner strip that crosses B{xj) in the unstable directions. Con- 
tinuing this construction: 

f]r{B{x,.r.))^D-{xj,{D}) 

n>0 



is a disk that stretches across B{xj) in the unstable direction. We call it the 
unstable disk at Xj for the e pseudo orbit. It does not necessarily contain 
X,- and it is the same dimension as Ef . 

By construction, a point y G , {xi}) if and only if y G /” (B{xj-n)) 

for all n > 0, i.e., f~^{y) G B(xj_„) for all n > 0. This gives the ’’backward 
half’ of the shadowing orbit. Next we construct the ’’forward half’. 

From the stable manifold theorem it follows that D'^{xj,{xi}) is 
and almost tangent to Ef.. Moreover, / expands Z?“(xj,{xj}) across 
I?“(xj+i, {xi}). Thus f~^ : D“(xj+i, {xi}) — >■ T>“(xj,{xi}) is a uniform 
contraction. Hence, the contraction mapping principle implies that 

fl {/-"(I9“(x„,{x00)}, 

n>0 



is the unique point y such that y G /“”(H(x„)) for all n, i.e. such that 
f^{y) G B{xn) for all n. Hence, the orbit of y 6 shadows the e pseudo 
orbit . □ 



31.3a Applications of the Shadowing Lemma 

Applications of the shadowing lemma present themselves immediately. In 
many settings our dynamical systems are not “perfect” . The imperfection 
can arise from modelling errors or uncertainties. They could arise in the 
process of numerical simulation of the trajectories of a dynamical system 
through round-off errors. In each of these situations we would like to know 
that the trajectories of our “approximate” dynamical system are “close” 
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to a trajectory of the “real” dynamical system. The shadowing lemma 
provides a compelling framework for thinking about such issues. Results 
along these lines can be found in Nusse and Yorke [1988], Grebogi et al. 
[1990], Coomes et al. [1993], [1994a,b], [1995a, b], [1997], Coomes [1997], 
Palmer [1996], Sauer et al. [1997], Stoffer and Palmer [1999], van Vleck 
[1995], [2000], Corless [1992], Fryska and Zohdy [1992], Sauer and Yorke 
[1991], and Palmore and McCauley [1987]. 

However, it must be understood that shadowing only rigorously holds 
in the setting of hyperbolic dynamics-a setting the is extremely rare in 
applications. Numerical simulations on the effect of the loss of hyperbolicity 
on shadowing can be found in Dawson et al. [1994]. A rigorous result has 
recently been obtained by Bonatti et al. [2000]. 

Applications of shadowing in partial differential equations can be found 
in Larsson and Sanz-Serna [1999], Ostermann and Palencia [2000], An- 
genent [1987], and Chow et al. [1989]. 

Applications of shadowing in perturbation theory can be found in Mur- 
dock [1990], [1995], [1996], and Lin [1989], [1996]. 

Applications of shadowing in fluid mechanics can be found in Klapper 
[1992], [1993] and Ghosh et al. [1998]. 

An application of shadowing to predictability can be found in Pearson 

[ 2001 ]. 

A shadowing lemma for random maps is developed in Chow and van 
Vleck [1992/93]. 

Alternate proofs of the shadowing lemma can be found in Meyer and Sell 
[1987], Feckan [1991], and Hadeler [1996]. 

Other interesting references concerning shadowing are Chu and Koo 
[1996], Henry [1994], Kruger and Troubetzkoy [1992], Blank [1991], Stein- 
lein and Walther [1991], and Palmer [1988]. 



31.4 Exercises 

1. Recall from the proof of Theorem 31.1.2 that 

C\ — I continuous line bundles over A contained in 
with typical “points” in denoted by 

C\(a{zo)) = IJ 
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■Ca(/1(^o)) = U 



where 



-^c(2o) ~ { ^ ' 

-^/3(2q) ~ { ^ ' 



I izo = “(zoi^J^o}’ 
I ~ /l(zo)^zo } ■ 



We defined a metric on C^i as follows 



II^a(“(^o)) - -CX(/3(2o))II = sup |a(zo) - /3(zo)|. 

^o£A 



(31.4.1) 



a) Prove that (31.4.1) is indeed a metric. 

b) Prove that is a complete metric space with the metric (31.4.1). 

c) Recall the unstable invariant line bundle constructed in Theorem 31.1.2 that we 
denoted 

£“= U K- 

zq€A 

For = (^zq.Vzq) e prove that 

\Df-\zoK.„ \ < AIC.ol 

where 0 < A < 1. 



2. Prove Theorem 31.1.3. Hints: 1) show that the map F defined on in (31.1.7) can 
be extended to a map on line bundles over A_oo (the /^^;-vertical curves) and not just 

A. 

2) Next, let the graph of a: = v{y) be a /^t;-vertical curve in A_oo and let zq — {xQ,yo) 
be a point on that curve. Let Tzq denote the set of lines ^ = Oi{zQ)r} with 



(x{zq) — lim 



vjyn) - viVn) 

Vn - y'n 



where yn 7 ^ y'^ are two sequences approaching yo for which this limit exists. Show that 
l^(-^o)l ^ and that the set of (x{zq), zq fixed, satisfying this is closed. 

3) Let 

cu{Tzq) — maxo;( 2 o) — mina( 2 o) < 

where the maximum and minimum is taken over the set defined above. Show that if 
co(Tzq) — 0 , then the curve has a derivative at zq and that, since the two sequences, 
yn and y!^, were arbitrary, the derivative is continuous. 

4) Finally, we will be through if, from Step 3, we show that uj{Tzq) — 0. This is done as 

follows. First show that F{Tzq) = Tzq, zq G A — oo, by using the mean value theorem. 
Next use the contraction property to show that uj{Tzq) — io{F{TzQ)) < ^lo{Tzq) and 
from this conclude that co{Tzq) = 0. Does it follow that agrees with the tangent 

to this curve? 

If you need help see Moser [1973]. 

3. Consider the invariant set A constructed in Theorem 25.2.1. Using Theorems 31.1.2, 
31.1.3, and 31.2.2 describe in detail the stable and unstable manifolds of A. 

4. Horseshoes are Structurally Stable. Suppose that a map f:D—¥ satisfies the 
hypothesis of Theorem 25.2.1. Then it possesses an invariant Cantor set A. Show that, 
for e sufficiently small, the map f sg (with g C’^, t > 1, on D) also possesses an 

invariant Cantor set Ag. Moreover, show that Ag can be constructed so that {f-\-sg)\ 

lAg 

is topologically conjugate to /I . 
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Long Period Sinks in 
Dissipative Systems and 
Elliptic Islands in Conservative 
Systems 



Long period sinks in dissipative systems and elliptic islands in conservative 
systems are the “demons” that thwart proofs of the existence of strange 
attractors in dissipative systems and sets of positive measure on which the 
dynamics is chaotic in conservative systems. In this chapter we describe 
some aspects of these phenomena. 



32.1 Homoclinic Bifurcations: Cascades of 
Period-Doubling and Saddle-Node 
Bifurcations 



In Chapter 26 we described some aspects of the complex dynamics asso- 
ciated with a transverse homoclinic orbit to a hyperbolic fixed point. In 
particular, the map possessed a countable infinity of unstable periodic or- 
bits of all periods. We now want to consider the situation of a bifurcation 
to transverse homoclinic orbits. Specifically, we consider a one-parameter 
family of diffeomorphisms of the plane having a hyperbolic periodic orbit 
(which, without loss of generality, we can assume is a fixed point). Refer- 
ring to the parameter as fi, suppose, for /i > /tq, the stable and unstable 
manifolds of the fixed point do not intersect and, for fj, < /iq, they intersect 
transversely (the reader might peek ahead to Figure 32.1.2). Hence, a nat- 
ural question arises; as we go from /x > /io (i.e., no horseshoe) to /i < /xq 
( i.e., many horseshoes), how are all the unstable periodic orbits created? 
We will see that under certain conditions the creation of the complicated 
dynamics associated with a transverse homoclinic orbit to a hyperbolic 
periodic orbit is an infinite sequence (or cascade) of period-doubling and 
saddle-node bifurcations. The set-up for the analysis will be very similar 
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to that given in Chapter 26 for the proof of Moser’s theorem. Specifically, 
we will analyze a sufficiently large iterate of the map that is defined in a 
neighborhood (in both phase and parameter space) of a homoclinic point. 
We begin by stating our assumptions. 

We consider a one-parameter family of two-dimensional C’’ (r > 3) dif- 
feomorphisms 

z G /r G / C (32.1.1) 



where I is some interval in We have the following assumption on the 
map. 

Assumption 1: Existence of a Hyperbolic Fixed Point. For all p, € I, 

/(0,/x)=0. (32.1.2) 



Moreover, z = 0 is a hyperbolic fixed point with the eigenvalues of Df{0, p) 
given by p{p), \{p), with 



0 < p{p) < 1 < \{p) < 



(32.1.3) 



We remark that since (32.1.3) is satisfied for all p G I, we will often omit 
denoting the explicit dependence of the eigenvalues p and X on p unless it 
is relevant to the specific argument being discussed. We denote the stable 
and unstable manifolds of the hyperbolic fixed point by fFp(O) and W“(0), 
respectively. 

Assumption 2: Existence of a Homoclinic Point. At p = Q, kFo(O) and 
Wq{Q) intersect. 

Assumption 3: Behavior Near the Fixed Point. There exists some neigh- 
borhood J\f of the origin such that the map takes the form 

y, p) = {px, Xy), (32.1.4) 



where x and y are local coordinates in Af . 

Note that Assumption 3 implies that W®(0) C\ Af and ^“(0) C\ Af are 
given by the local coordinate axes. 

Our final assumption will place more specific conditions on the geometry 
of the intersection of W®(0) with IF^(O) at /i = 0. This is most conveniently 
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done in terms of the local return map in a neighborhood of a homoclinic 
point which we now derive. 

We are interested in the dynamics near a homoclinic point. Therefore, 
using the same construction as that given in the proof of Moser’s theorem 
in Chapter 26, we will derive a map of a neighborhood of a homoclinic 
point into itself, which is given by /^, for some N (large). The construction 
proceeds as follows. Assumption 3 implies that there exists a point (0, ?/o) G 
Wq (0) nAf and a point (ccq, 0) G bbg (0) HAf such that /^(O, yo; 0) = (a;o, 0) 
for some fc > 1. Thus, following the construction in Chapter 26, we can find 
a neighborhood of (0,yo), Uy^ C N, and a neighborhood of (xo,0), Ux^ C 
N, such that f^{Uyg; 0) = Uxo', see Figure 32.1.1 (for all the details, see the 
construction in Chapter26) Note that by continuity, /^(-; fA) will be defined 
in Uyg for y sufficiently small. We can now state our final assumption. 




FIGURE 32.1.1. 



Assumption 4-' Quadratic Homoclinic Tangency at y = 0. We assume that, 
in Uyg, has the form 



: [/, 



yo 

{x,y) 



Ux 



(a;o - !3{y - yo), M + 7a: + 5{y - yof), 



(32.1.5) 



with /3,7, 5 > 0. Hence, for ji near zero, W^(O) and hF“(0) behave as in 
Figure 32.1.2. 

From Lemma 26.0.4 and the arguments given in Chapter 26 it follows 
that there exists an integer Nq such that, for all n > Nq, we can find subsets 
Uxg C UxQ such that /”(a;,y;y) = (p”a;,A”y) maps [7”^ into Uyg, i.e., 

nU^g-y) C Uyg 



(32.1.6) 
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FIGURE 32.1.2. 

for /X sufficiently small (note: this can be trivially verified due to the fact 
that we have assumed / is linear in Af ) . Therefore, 

r 0/'=^/"+'=: C/,0, 

{x, y) I — s> (p”(a;o - P{y - yo)),X'^{y + ix + S{y - yo)^)), (32.1.7) 

is well defined for y, sufficiently small (note: in general n = n{y)). 

We now give our first result. 

Theorem 32.1.1 (Gavrilov and Silnikov [1973]) At y = 0 there ex- 
ists an integer Nq such that, for all n > Nq, there exists a set h-n+k C M , 
invariant under such that is topologically conjugate to a 

full shift on two symbols. 

Proof: The proof can be found in Gavrilov and Silnikov [1973] or Gucken- 
heimer and Holmes [1983]. The basic idea is to choose a neighborhood of 
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FIGURE 32.1.3. 

(0,j/o) so that maps it back over itself and Assumptions 1 and 

3 of Chapter 25 hold; see Figure 32.1.3. In Exercise 1 we outline the steps 
necessary to prove this theorem. □ 



The next theorem tells us how the periodic orbits in the sets A„+fc, 
n > No, are created as ^ decreases through zero. 

Theorem 32.1.2 (Gavrilov and Silnikov [1973]) There exists an in- 
teger No and infinite sequences of parameter values 

n > No, 

{Tpd^}, n>No 



with > 0, qip+J' > 0 and 



oo 0, n oo 0 such that 



corresponds to a saddle-node bifurcation value for and 

corresponds to a period- doubling bifurcation value for f^+^. Moreover, 

Nn-\-k ^ Nn-\-k ^ iVn-j-l-l-fc ....^ .. iVn-l-771-l-fe ^ 7Vn~|“?Tl-l-fc ....^ 

Msw > l^PD > h-SN > Tpd >■■■> Tsn > Tpd > ■■■ 

(32.1.8) 



with 
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~ A " as n — >■ oo (32.1.9) 

and 

os n-^oo. (32.1.10) 

Before proving Theorem 32.1.2 we want to make several remarks. 

Remark 1. In the saddle-node bifurcation at /x = the node is actually 

a sink. The two orbits created are period n + k orbits for /. The sink cre- 
ated in the saddle-node bifurcation subsequently loses stability in a period- 
doubling bifurcation at ^ = /Xp"J)f (hence, we must have /Xpp^ < 
resulting in the creation of a period 2(n + k) sink for /. This bifurcation 
scenario will be verified in the course of the proof of Theorem 32.1.2. 

Remark 2. Theorem 32.1.2 tells us how the countable infinity of periodic 
orbits in the horseshoes for /x < 0 are created; namely, the periodic orbits 
are created in saddle-node bifurcations, and the period is increased through 
period-doubling bifurcations. 

Remark 3. Equations (32.1.9) and (32.1.10) gives us the rate by which the 
period-doubling and saddle-node bifurcation values accumulate on /x = 0. 
This rate is not universal but depends on the size of the unstable eigenvalue 
of the hyperbolic fixed point. 



Remark 4- Theorems 32.1.1 and 32.1.2 are stated explicitly for dissipative 
or nonconservative maps, i.e., Xp < 1. However, similar results will hold 
for area-preserving maps where we have Xp = 1; see Newhouse [1983] and 
Exercise 3. 

We now begin the proof of Theorem 32.1.2. 

Proof: The proof is constructive. The condition for fixed points of is 
given by 



x = p^xo- (}p^{y-yo), (32.1.11) 

y = ^A" + 7A”x + a"(x/-x/o)", (32.1.12) 

where it is important to recall that a;o, yo, 7 , /3, 5, and p > 0. By substi- 
tuting (32.1.11) into (32.1.12) we obtain 

<5A"j/2 _ (/3^A>" + 25A"yo + l)v + SX'-yl 
+ /37A>”2/o + 7P”A”xo + /xA" = 0. 



(32.1.13) 
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Solving (32.1.13) yields 

/37A”p” + 26X^yo + 1 

±^[((/37A>" + 25A"j/o + 1)" (32.1.14) 

- 46X^{SX^y^o + /37A”p”yo + 7 A>"xq + M”)] 

After some algebra, the expression under the radical in (32.1.15) can be 
simplified so that (32.1.15) becomes 

_ /37A"p" + 25A”yo + 1 ^ 1 

2(DA ^ 2(5A" 

X — + (j/oA~" - 7p”xo) - . (32.1.15) 

Note that (32.1.15) is a function of n and y,. Thus, (32.1.15) gives the 
y coordinate of a fixed point which can be substituted into (32.1.11) to 
obtain the x coordinate. Note that since (32.1.11) is linear in x, for a fixed 
y coordinate there is a unique x coordinate for the fixed point. Thus, in 
studying the numbers of fixed points and their bifurcations, it suffices to 
study only (32.1.15). 

From (32.1.15) we can easily see that there are no fixed points for 

d > + (yoA-" - ^p-xo) (32.1.16) 

and two fixed points for 

4+ A J 

Therefore, 

d = + (yoA-" - ^p-xo) (32.1.18) 

is a bifurcation value for 

Next, we verify that this is a saddle-node bifurcation. This can be shown 
directly. From (32.1.7), the matrix associated with the linearized map is 
given by 
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hence we have 

det or+'^ = 7/3p”A", (32.1.20) 

tri7/"+fc = 2(5A”(y-2/o), (32.1.21) 

with eigenvalues, xi and X 2 , of (4.7.17) given by 

fj. nf"+fe 1 / 

Xi.2= ^ ±-^(trZ9/"+fc)2-4deti4/"+^ (32.1.22) 

At the bifurcation value (32.1.18) there is only one fixed point where, using 

(32.1.20), (32.1.21), and (32.1.22), the eigenvalues of (32.1.19) are given by 

XI = 1, X2=7/3p”A". (32.1.23) 

Note that since pX < 1, by taking n sufficiently large, X 2 can be made 
arbitrarily small. 

At this point we want to check the stability of the bifurcating fixed points. 
From (32.1.20), (32.1.21) and using pX < 1, we see that for n sufficiently 
large, the eigenvalues of (32.1.19) are approximately given by 

Xl«trZ9/"+^ (32.1.24) 

X2« 0, (32.1.25) 



and by substituting (32.1.15) into (32.1.21) (and neglecting terms of 
0(p”A")) we have 




Thus, for the branch of fixed points with y coordinate given by 
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the eigenvalues associated with the linearized map are, for n sufficiently 
large, approximately given by 

XI « 1 + ^ + (ypA-" - 7X0P") - , 

X2 « 0. (32.1.28) 

Hence, for 

/t < ^ + (l/oA-" - 7^0P”) (32.1.29) 

(which, from (32.1.18), is the saddle-node bifurcation value), it is easy to 
see that this fixed point is always a saddle. Similarly, for the branch of fixed 
points given by 

/37A”p” -k 2(5A”j/o + 1 




(32.1.30) 

the eigenvalues associated with the linearized map are, for n sufficiently 
large, approximately given by 




Therefore, for fj, “slightly” less than — — + (yoA“" — 7 p"xo), it is 

easy to see that this fixed point is a sink. However, as fj, decreases further, it 
is possible for xi to decrease through —1 and, consequently, for this branch 
of fixed points to undergo a period-doubling bifurcation. We now want to 
study this possibility. 

We are considering the branch of fixed points with y coordinate given by 

/37A”p” -k 2(5A”j/o + 1 1 

^ “ 2(5A" 2(5A" 

X +(j/oA-"-7p"xo) -a^^( 32 . 1 . 32 ) 
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From (32.1.22), the condition for an eigenvalue of to be —1 is 

1 + det £'/”+'= = -tr £)/”+'=. (32.1.33) 

Substituting (32.1.20) and (32.1.21) into (32.1.33) yields 

1 + 7/3/9” A" = 2(5A”yo - 25A”y, (32.1.34) 



and substituting (32.1.32) into (32.1.34) yields 

1 + 7/3/9” A” = ^ ^2" (^JhPl±2^ + (y^A-" - 7P”xo) - . 

(32.1.35) 

By solving (32.1.35) for we obtain 

M = -i(7/3/9” + A-”)2 + (2 /oA-” - 7P”xo). (32.1.36) 



Hence, (32.1.36) is the bifurcation value for the period-doubling bifurcation 
of the sink created in the saddle-node bifurcation. We leave it as an exercise 
for the reader to verify that the period-doubling bifurcation is “generic” 
(see Exercise 2). 



Let us summarize what we have shown thus far. The map for n 

sufficiently large, undergoes a saddle-node bifurcation at 



n+k _ {Pip"' 

PSN — 



+ A-”)2 



46 



+ (//oA ”-7/9”xo). 



(32.1.37) 



In this bifurcation two fixed points of are created, a saddle and a 

sink. As /x decreases below the saddle remains a saddle but the sink 

undergoes a period-doubling bifurcation at 






3(/37p” -k A-”)2 
45 



+ (//oA ”-7p”a;o). 



(32.1.38) 



It is easy to see from (32.1.37) and (32.1.38) that we have 



n+k , ,,n+k 
PPD ^ PSN ■ 



Also, for n sufficiently large, we have 

yoA-” - 7/9”xo » (/37/9” + A-”)2. 



(32.1.39) 
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Using (32.1.39) along with the fact that 



(j/oA-" - jp'^xo) = A-"(yo - 7 P”A”xo) 



(32.1.40) 



with pA < 1 implies that 
and 



> 0 



> 0 



for n sufficiently large. Next we need to show that 

,.n+l+k , ,,"+fc 

PSN ^ PPD ■ 

From (32.1.37) and (32.1.38), we have 

”+1+'= - (/^AP"+^-A " ^ \-"-\yo - 7p”+^A-”-^xo) (32.1.41) 



f^SN — 
and 



A5 



n+k 3{(3jp^ - , 

t^PD = — ^ + A ”(yo - 7P"A"a:o). 



(32.1.42) 



Using (32.1.39) and (32.1.40), along with the fact that A > 1, we can easily 
see from (32.1.41) and (32.1.42) that, for n sufficiently large, we have 



n+l+fc ^ 

^ ^PD ' 



(32.1.43) 



Equation (32.1.8) now follows from (32.1.43) by induction. Finally, it fol- 
lows from (32.1.39), (32.1.37) and (32.1.38) that 

~ A“" as n — >■ oo 



and 

~ A“" as n — >■ oo. □ 

Let us now comment on the generality of our results. In particular, in 
Assumption 2 we assumed that our map is linear in a neighborhood, Af, 
of the origin, and in Assumption 3 we assumed that the form of the map 
defined outside of a neighborhood of the origin is given as in (32.1.5). It 
follows from the work of Gavrilov and Silnikov [1972], [1973] that our results 
are not restricted by these assumptions in the sense that if we assume the 
most general forms for f in Af and for /*, Theorems 32.1.1 and 32.1.2 are 
unchanged. We remark that this generally holds for the study of the orbit 
structure near orbits homoclinic to hyperbolic periodic points. A return 
map defined near a homoclinic point constructed as the composition of an 
iterate of the linearized map near the origin with low-order terms in the 
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Taylor expansion of an iterate of the map outside of a neighborhood of 
the origin is sufficient to capture the qualitative dynamics in a sufficiently 
small neighborhood of the homoclinic point. Let us now briefly describe 
the set-up considered by Gavrilov and Silnikov. 



In local coordinates in M, Gavrilov and Silnikov showed that a general 
C’’ (r > 3) diffeomorphism can be written in the form 



/ X{fi)x + f{x,y; y.)x\ 
\p{y)y + g{x,y;n)y ) ' 



(32.1.44) 



For the form of acting outside of J\f (but mapping a neighborhood of a 
homoclinic point on the local unstable manifold in Af to a neighborhood of 
the local stable manifold in M) , they assumed the completely general form 



f-- 



xo + F{x,y-yo;p) 
G{x,y - yo;y.) 



(32.1.45) 

The assumption of quadratic homoclinic tangency at /x = 0 requires 



Gj^(0,0,0) = 0, (32.1.46) 

Gj,j,(0,0,0) ^ 0 (32.1.47) 

(note: since / is a diffeomorphism, then (32.1.46) implies that we must have 
Gx(0,0,0) yf 0 and Fy(0, 0,0) yf 0). Gavrilov and Silnikov then simplify 
(32.1.45) as follows; letting 



y -yo = (32.1.48) 

be the (unique) solution of 

Gy{x,y - yo,y,) = 0 (32.1.49) 

(which can be solved by the implicit function theorem as a result of 
(32.1.47)), (32.1.45) can be rewritten as 



I xo + F{x,y -yo]y) \ 

\y J \E{y-) + G{x, y)x + D{x, y){y - yo - (j^ix, y))^ J ’ 

where 



(32.1.50) 



E{y) = G{0,cj){0,y),y), E{0) = 0, 

G(0,0) = c, 

2L>(0, 2/0,0) = d. 



The reader should note the similarity of (32.1.50) and (32.1.5). The num- 
bers c and d in (32.1.50) describe the geometry of the tangency of the stable 




774 



32. Long Period Sinks in Dissipative Systems and Elliptic Islands 



and unstable manifolds. Gavrilov and Silnikov show that there are ten cases 
to consider depending on the signs of A, p, c, and d. (Note: yes, there are 
16 possible combinations of signs of these parameters, but Gavrilov and 
Silnikov show how to reduce the number of possibilities.) 

Five of the cases correspond to orientation-preserving maps with the re- 
maining five corresponding to orientation-reversing maps. The example we 
treated corresponds to one of the five orientation-preserving maps with A, 
p, c,d> 0. The structure of the period-doubling and saddle-node cascades 
can be different for the remaining cases, and we refer the reader to Gavrilov 
and Silnikov [1972], [1973] for the details. 



32.2 Newhouse Sinks in Dissipative Systems 



The following corollary is an obvious consequence of the proof of Theorem 
32.1.2. 

Corollary 32.2.1 Let Pq denote a point of quadratic tangency of VFq (0) 
and Wq{Q). Then for all n sufficiently large there is a parameter value, 
^n+k ^ Ijn+k q as u ^ oo , such that f has a periodic sink, Pn, of 
period n + k with Pn ^ Po as n — >■ oo . 



The reason for pulling out this corollary from Theorem 32.1.2 is that if we 
put it together with some deep results of Newhouse concerning the persis- 
tence of quadratic homoclinic tangencies, we get a very provocative result. 
(Note: Newhouse’s results do not require our more restrictive Assumptions 
3 and 4 given above but, rather, they apply to any two-dimensional dif- 
feomorphism having a dissipative hyperbolic periodic point whose stable 
and unstable manifolds have a quadratic tangency.) We now want to give 
a brief description of the implications of Newhouse’s results in the context 
of this section. 

It should be clear that a specific point of tangency of IFq (0) and Wq{0) 
can easily be destroyed by the slightest perturbation. However, Newhouse 
[1974] has proven the following result. 



Theorem 32.2.2 For £ > 0, let Ig = {p £ I \ \p\ < s}. Then, for every 
e > 0, there exists a nontrivial interval Ig C Ig such that Ig contains a 
dense set of points at which IT^(O) and 1F“(0) have a quadratic homoclinic 
tangency. 
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Proof: This “parametrized” version of Newhouse’s theorem is due to Robin- 
son [1983]. □ 



Heuristically, Theorem 32.2.2 says that if we destroy the quadratic ho- 
moclinic tangency at ^ = 0 by varying /x slightly, then for a dense set 
of parameter values containing /i = 0, we have a quadratic homoclinic 
tangency somewhere else in the homoclinic tangle. Thus, Corollary 32.2.1 
can be applied to each of these tangencies so that Corollary 32.2.1 and 
Theorem 32.2.2 together imply that there are parameter values at which 
the map has infinitely many periodic attractors which coexist with Smale 
horseshoe-type dynamics. This phenomenon is at the heart of the diffi- 
culties encountered in proving that a two-dimensional map possesses a 
“strange attractor.” We will discuss this in much more detail in Chapter 
30. 

We close with some final remarks. 



Remark 1: For higher dimensional generalizations of the Newhouse sink 
phenomenon see Palis and Viana [1994]. 

Remark 2: The Codimension of a Homoclinic Bifurcation. By the term 
“homoclinic bifurcation,” we mean the creation of transverse homoclinic 
orbits to a hyperbolic periodic point of a two-dimensional diffeomorphism 
as parameters are varied. Since the stable and unstable manifolds of the hy- 
perbolic periodic point are codimension one, their transversal intersection 
occurs stably in a one-parameter family of maps. Now recall the definition 
of “codimension of a bifurcation” given in Chapter 20. Roughly speaking, 
the codimension is the number of parameters in which the corresponding 
parametrized family is stable under perturbations. We have seen in this 
section that there is an infinity of saddle-node and period-doubling bifurca- 
tion values accumulating on the quadratic homoclinic tangency parameter 
value. Thus the type of bifurcation is codimension infinity by the standard 
definitions. In particular, one cannot find a versal deformation for this type 
of bifurcation satisfying the standard definitions given in Chapter 20. 



Remark 3. There have been a number of papers by the Maryland group 
over the last few years that have shed much light on the creation of horse- 
shoes. In particular, we refer the reader to Yorke and Alligood [1985] and 
Alligood et al. [1987]. In Tedeschini-Lalli and Yorke [1986] the question of 
the measure of the set of parameter values for which the map possesses an 
infinite number of coexisting periodic sinks is addressed. For a “generic” 
map, the measure of the set is shown to be zero. 
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Remark 4- For additional recent work on the dynamical consequences as- 
sociated with homoclinic tangencies see Diaz et al. [1999], Diaz and Rocha 
[1997a, b], Diaz [1995] and Diaz and Ures [1994]. 



32.3 Islands of Stability in Conservative Systems 

There is a conservative analog of the Newhouse sink phenomenon. Early 
results along these lines are due to Newhouse [1977]. An analysis similar in 
spirit to the Gavrilov and Silnikov work described in this chapter is given 
in Gonchenko and Silnikov [2000]. See also Duarte [1999]. 



32.4 Exercises 



1. Prove Theorem 32.1.1. {Hints: the idea is to find a region Sn in Uy^ such that As- 
sumptions 1 and 3 of Chapter 25 hold for i) Let Sn — \ \y — yo\ S: 

0 < ai < i/^} where p <. v <. The idea is to show that (Sn ; 0) intersects 

Sn in two /a^;-vertical strips. Then the pre-image of these two /^.^-vertical strips will 
be /L;^-horizontal strips with proper boundary behavior and 0 < < 1- This can 

be accomplished in several steps. First show that under /^(-;0), vertical lines in Sn 
(i.e., lines with x — c — constant) map to parabolas in Uxq given by the graph of 
y — 7 c-|- {x — xq)^ . 2) Show that for n sufficiently large, the x-components of points 
in are smaller than . 3) Finally, show that for s — s{n) ~ {yoX~^ / , 

f^^^{Sn \ 0) cuts through the top and bottom horizontal boundaries of Sn - From these 
three facts you should be able to find /ih,-horizontal and /j.t;-vertical strips so that As- 
sumption 1 is satisfied. 

The proof that Assumption 3 holds is very similar to the same step carried out in 
Theorem 26.0.5. Use the fact that 



Dr+'‘(x,y,0) 



( 0 -/3p" \ 

2(5A’^(y - yo) / 



and that, for \y — yol ~ (2/0-^ and n sufficiently large, this Jacobian is essen- 

tially 

D/"+y.,y;0)~ (^°„ 

(Drawing figures in each case should help.) 

2. Recall the proof of Theorem 32.1.2. In showing that the sink created in the saddle-node 

bifurcation of subsequently underwent a period-doubling bifurcation at 

3 n -n 2 -n n 

p = - — (7/3p -I-''* ) -I- (yoA -IP xo), 

we only showed that the map had an eigenvalue of —1 at this parameter value. Examine 
the nonlinear terms (possibly do a center manifold reduction) to show that this period- 
doubling bifurcation is indeed nondegenerate or “generic.” 

3. Show that Theorem 32.1.2 holds for area-preserving maps. {Hint: use the implicit 
function theorem proof given in Tedeschini-Lalli and Yorke [1986].) 

4. Does the result of Theorem 32.1.1 hold for area-preserving maps? 
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Global Bifurcations Arising 
from Local Codimension — Two 
Bifurcations 



In Section 20.6 we studied the bifurcation of a fixed point of a vector 
field in the situation where the matrix associated with the linearization of 
the vector field at the bifurcation point had two zero eigenvalues, and in 
Section 20.7 we studied the situation where the matrix had a zero and a 
pure imaginary pair of eigenvalues (with any remaining eigenvalues having 
nonzero real part). In both cases we saw that dynamical phenomena arose 
which could not be explained by any local bifurcation analysis, and in this 
section we want to attempt to complete the analysis. In the case of the 
double-zero eigenvalue we will succeed completely. In the case of the zero- 
pure imaginary pair we will only achieve partial success. We begin with the 
double-zero eigenvalue. 



33.1 The Double-Zero Eigenvalue 



Recall from Section 20.6 that the truncated normal form associated with 
this bifurcation is given by 

x = y, 

y = fii + Ii 2 y + + bxy , 6 = ±1. (33.1.1) 



Equation (33.1.1) applies to generic vector fields, i.e., there are no symme- 
tries and we treat the case 5 = -1-1. 

Recall that (33.1.1) has no periodic orbits for > 0 and, for < 
0, periodic orbits are created in a Poincare-Andronov-Hopf bifurcation. 
Therefore, there must be some other bifurcation occurring which accounts 
for the destruction of the periodic orbits as yi increases through zero. In 
Section 20.6 we gave some heuristic arguments as to why this should be 
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a homoclinic or saddle-connection bifurcation, and now we want to prove 
this. 

We begin by rescaling the dependent variables and parameters of (33.1.1) 
as follows 



x = £^u, y = e^v, yi = —e'^, M 2 = (e > 0), (33.1.2) 

and we rescale the independent variable time as follows 




so that (33.1.1) becomes 



u = V, 

V = —1 e{v 2 V uv) . (33.1.3) 



Notice that, in the original variables, we are interested in /ii < 0 and that 
our rescaling allows us to interpret our results in this parameter regime. 
(Note: the reader should be somewhat irritated that we have simply pulled 
this particular rescaling “out of the air.” For now, we will proceed with the 
analysis, but at the end of this section we will discuss “why it works.”) 

The single most important characteristic of this rescaling is that, for 
£ = 0, the rescaled equations (33.1.3) become a completely integrable 
Hamiltonian system with Hamiltonian function given by 

H{u,v) = '^+u-'^. (33.1.4) 

Melnikov’s method can then be used to perform a global analysis that 
includes the effects of the higher order terms of the normal form. The 
phase space of this completely integrable Hamiltonian system is shown in 
Figure 33.1.1. Thus, the vector field 



u = V, 

v = -l + u2^ (33.1.5) 



has a hyperbolic fixed point at 



{u,v) = ( 1 , 0 ) 
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FIGURE 33.1.1. 

an elliptic fixed point at 

(u,v) = (-1,0), 

and a one-parameter family of periodic orbits surrounding the elliptic fixed 
point. We denote the latter by 

(u“(t), w“(t)), aG[— 1,0) (33.1.6) 

with period T“ where 

(M-i(t),u-i(t)) = (-l,0) (33.1.7) 



and 



lim(M“(t),i;“(t)) = {uo{t),Vo{t)) 

Oc—^0 



= ( 1 — 3sech^ , 3-\/2sech^ tanh 

V2 ^2^2 



(33.1.8) 



is a homoclinic orbit connecting the hyperbolic fixed point to itself. 

The Melnikov theory can now be used to determine the effect of the 
0{e) part of (33.1.3) on this integrable structure. The homoclinic Melnikov 
function is given by 

/ OO 

vo{t) [iy2Vo{t) + Mo(i)^^o(i)] dt. 

-OO 



(33.1.9) 
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Using the expression for uo(f) and vo{t) given in (33.1.8), (33.1.9) becomes 

M(j/2) =7i^2- 5 



M{iy2) = 0=>iy2 = Y, (33.1.10) 

hence, a bifurcation curve on which the stable and unstable manifolds of 
the hyperbolic fixed point coincide is given by 

jy^=^ + 0{e). (33.1.11) 

We now want to translate (33.1.11) back into our original parameter values. 
Using (33.1.2) and (33.1.11), we obtain 

= - (^) (33.1.12) 



for the homoclinic bifurcation curve. Note that we have 

49 

M > 0 for /xi > 



M < 0 for /xi < 



(33.1.13) 

(33.1.14) 



which give us the relative orientations of the stable and unstable manifolds 
of the hyperbolic fixed points that we show in Figure 33.1.2. 

Thus, we have shown the existence of a global mechanism for periodic 
orbits to be created and destroyed. Can we now actually claim that the pe- 
riodic orbit created in the Poincare- Andronov-Hopf bifurcation is the one 
that is destroyed in the homoclinic bifurcation? No, we cannot, because it 
is possible for nonlocal saddle-node bifurcations of periodic orbits to occur 
in the parameter region between the Poincare-Andronov-Hopf and homo- 
clinic bifurcation curves. We can check whether or not such bifurcations 
occur by computing the Melnikov function for the periodic orbits. This is 
given by 

M {a] V2) = i>2 [ dt + ( u°‘{t){v°'{t)Ydt. (33.1.15) 

Jo Jo 



The condition M{a]V 2 ) = 0 implies the existence of a periodic orbit for 
(33.1.3). Using (33.1.15), this is equivalent to 
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FIGURE 33.1.2. 






ir 



(33.1.16) 



Now, if /(a) is a monotone function on [—1,0], we can conclude that 
(33.1.3) has a unique periodic orbit created in a Poincare-Andronov-Hopf 
bifurcation and destroyed in a homoclinic bifurcation. It turns out that fipt) 
is indeed monotone. This can be verified by computing the expressions for 
(u“(t), u“(t)) directly in terms of elliptic functions and then analytically 
evaluating /(a) from (33.1.16). Once this is done, monotonicity properties 
of /(a) may be studied. This is a fairly messy (but, in principle, straight- 
forward) calculation that we leave as an exercise for the interested reader. 
We note that this result was obtained by Bogdanov [1975], Takens [1974], 
and Carr [1981]. Therefore, the complete bifurcation diagram for (33.1.1) 
with 6 = -|-1 is as shown in Figure 33.1.3 and we end with the following 
remarks. 



Remark 1. The local bifurcation analysis of (33.1.1) did not require any 
smallness restrictions on fii and ^ 2 - The global bifurcation analysis does 
require /ii and to be “sufficiently small.” 



Remark 2. It is now fairly easy to show that restoring the higher order terms 
in the normal form (33.1.1) does not qualitatively affect the bifurcation 
diagram. We will outline the necessary arguments in Exercise 2. 
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33.2 A Zero and a Pure Imaginary Pair of 
Eigenvalues 



The normal form associated with this bifurcation is three-dimensional. 
However, the symmetry in the linear part associated with the pure imag- 
inary eigenvalues enabled us to decouple one of the coordinates from the 
remaining two so that we could begin our analysis using phase plane tech- 
niques. In some sense, the dynamics in this phase plane can be viewed as 
an approximation to a Poincare map of the full three-dimensional normal 
form. This is also a result of the symmetry of the linear part. Our analysis 
will proceed in the following steps. 

Step 1. Analyze global bifurcations of the associate truncated, two-dimen- 
sional normal form. 

Step 2. Interpret in terms of the truncated, three-dimensional normal form. 
Step 3. Discuss the effects of higher order terms in the normal form. 

Step 1. Recall from Section 20.7 that the associated two-dimensional normal 
form of interest is given by 

r = /iir -I- arz, 

z = fj ,2 + br^ — z“^ , (33.2.1) 



where a yf 0 and 6 = ±1. There were essentially only four distinct cases to 
study, and only two admitted the possibility of global bifurcations (for r, 
0 small). They were denoted by 
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Case IIa,b a < 0, 6 = +1, 

Case III a > 0, 6 = — 1. 



In Case IIa,b we are interested in the dynamics near the /i 2 -axis for fj ,2 < 0. 
In Case III we are interested in the dynamics near the /^ 2 -axis for /X 2 > 0. 
In both cases the normal form was integrable on the /i 2 -axis (with the 
appropriate sign of ^ 2 )', thus, we would expect higher order terms in the 
normal form to drastically affect the dynamics in this parameter regime. 
Our strategy will be to include the cubic terms in the normal form and to 
introduce a scaling of the variables so that we obtain a perturbed Hamilto- 
nian system. Then a Melnikov-type analysis can be used to determine the 
number of periodic orbits and possible homoclinic bifurcations. 

From (20.4.44), restoring the cubic terms to (33.2.1) gives 

f = /XiT -I- arz + {cr^ + dr'^z), 

2 + br“^ — z"^ + {er'^z + fz^). (33.2.2) 



In Guckenheimer and Holmes [1983] it is shown that coordinate changes 
can be introduced so that all cubic terms except for z^ in (33.2.2) can be 
eliminated (see Exercise 3 where we outline this procedure). Hence, without 
loss of generality, we can analyze the following normal form 

f = fiir + arz, 

z = fj ,2 + br“^ — z^ + fz^. (33.2.3) 

We next rescale the dependent variables and parameters as follows 

r = su, z = ev, /ii = fi 2 = s'^r' 2 , (33.2.4) 

and we rescale time as follows 



t 



et, 



so that (33.2.2) becomes 



M = auv + eviu, 

V = U2 + bu^ — v'^ + efv^. 



(33.2.5) 



At e = 0, the vector field has the first integral (for a yf 1) 

b 



F{u,v) = -u 



V2 



\ Q, 



2 2 
-U — V 



(33.2.6) 
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Unfortunately, it is not Hamiltonian, but can be made Hamiltonian (Guck- 
enheimer and Holmes [1983]) by multiplying the right-hand side of (33.2.5) 
by the integrating factor u( 2 /a)-i obtain 

ii = av^l^^v -I- 

V = + 5 „( 2 /a) + l _ y(2/a)-1^2 ^ (33 2J) 

where we let = Tl when 6 = ±1 since, for Case Ha,b, we are interested 
in ^2 < 0 and, for Case HI, we are interested in ^2 > 0- For e = 0, (33.2.7) 
is Hamiltonian with Hamiltonian function 

H{U,V) = + ^^2/a _ _^^^{2/a)+2 a+1^0 

^ ^ 2 2 2(a+ 1 ) ^ 

or 

H{u,v) = — blogu, a-|-l = 0. (33.2.8) 

In Figure 33.2.1 we show the level sets of the Hamiltonian (i.e., the orbits 
of (33.2.7) for e = 0) for the relevant cases. We see from this figure that, 
for Case Ha,b, the integrable Hamiltonian system has a one-parameter 
family of periodic orbits surrounding an elliptic fixed point with the orbits 
becoming unbounded in amplitude. In Case HI the integrable Hamiltonian 
system has a one-parameter family of periodic orbits surrounding an elliptic 
fixed point that limit on a heteroclinic cycle. In both cases we denote the 
one-parameter family of orbits by 

(u“(t),u“(t)), aG[-l,0), 

with period T“, where (t6“^(f), u“^(t)) is an elliptic fixed point in both Case 
Ha,b and Case HI, and lima_s.o(M“(i), ^“(t)) is an unbounded periodic orbit 
in Case Ha,b and a heteroclinic cycle in Case HI. 

The Melnikov functions are given by 

M{a; vi) 

= af f (u“(t))(4/“)-l(z;“(t))4rft 

Jo 

[6(u“(t))(4/“)+^ - 6(u“(t))(4/“)-i 
+ (u“(t))(4/“)-i(u“(t))2]dt. (33.2.9) 

Therefore, M{a;iyi) = 0 is equivalent to 
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= /(a). (33.2.10) 

We are interested in 

Case IIa,b a < 0, 5 = 1, / 0, <0, 

Case III a > 0,b = —1, f ^ 0, > 0. 

Thus, if f(a) is a monotone function of a (for a, b, and / fixed as above), 
then (33.2.7) has a unique periodic orbit which is born in a Poincare- 
Andronov-Hopf bifurcation (we consider stability in Exercise 4) and grows 
monotonically in amplitude in Case IIa,b and disappears in a heteroclinic 
bifurcation in Case III. However, proof that (33.2.10) is monotonic in a 
is a formidable and difficult problem, since the integrals cannot be evalu- 
ated explicitly in terms of elementary integrals. Fortunately, it has recently 





FIGURE 33.2.1. Integrable structure of (4.9.22) for e = 0. a) Case III; b) Case 
IIa,b, —1 < a < 0; c) Case IIa,b, a < —1. 
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been proven by Zoladek [1984], [1987] (see also Carr et al. [1985], van Gils 
[1985] and Chow et al. [1989] ) that (33.2.10) is indeed monotone in a. 
The techniques in these papers for proving monotonicity involve compli- 
cated estimates that we will not go into here. In Figure 33.2.2 we show an 
example of a possible bifurcation for Case III with a = 2, / < 0. 

Step 2. We now turn to Step 2, interpreting the dynamics of the two- 
dimensional vector field in terms of the three-dimensional dynamics. The 
truncated, three-dimensional normal form is given by 

r = /iir -I- arz, 
z = p ,2 + br'^ - + fz^, 

e = uj + --- . (33.2.11) 



Thus, the r and z components of (33.2.11) are independent of 9, and the 
discussion in Section 20.7 still holds. Namely, it is the case that the peri- 
odic orbits become invariant two tori in the full three-dimensional phase 
space and, in Case III, the heteroclinic cycle becomes such that the two- 
dimensional stable manifold and one-dimensional unstable manifold of the 
hyperbolic fixed point with z < 0 coincides with the two-dimensional unsta- 
ble manifold and one-dimensional stable manifold of the hyperbolic fixed 
point with z > 0 creating the invariant sphere (with invariant axis), as 
shown in Figure 33.2.3. Both situations are radically altered by the addi- 
tion of the higher order terms in the normal form and we now turn to a 




FIGURE 33.2.2. 
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discussion of this situation in Step 3. 



Step 3. The problem of how the invariant two-tori of the truncated normal 
form in Cases IIa,b and III and the heteroclinic cycle in Case III are affected 
by the higher order terms of the normal form is difficult, and the full story 
is not yet known. We will briefly summarize the main issues and known 
results. 



Invariant Two-Tori 

Concerning the invariant two-tori of the truncated normal form in Cases 
IIa,b and Case III there are two main questions that arise. 



1. Do the two-tori persist when the effects of the higher order terms of 
the normal form are considered? 

2. In the truncated normal form the orbits on the invariant two-tori 
are either periodic or quasiperiodic, densely covering the torus. Do 
invariant two-tori having quasiperiodic flow persist when the effects 
of the higher order terms of the normal form are considered? 



In answering the first question techniques from the persistence theory of 
normally hyperbolic invariant manifolds are used (see, e.g., Fenichel [1971], 
[1979], [1977], Hirsch, Pugh, and Shub [1977], and Wiggins [1994]). The 
application of these techniques is not straightforward, since the strength 
of the normal hyperbolicity depends on the bifurcation parameter. Results 




FIGURE 33.2.3. a) Cross-section of the heteroclinic cycle for the truncated nor- 
mal form, b) Heteroclinic cycle for the truncated normal form. 
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showing that invariant two-tori persist have been obtained by looss and 
Langford [1980] and Scheurle and Marsden [1984]. 

In answering the second question small divisor problems arise which 
necessitate the use of KAM-type techniques (see, e.g., Siegel and Moser 
[1971]). This is very much beyond the scope of this book. However, we 
mention that some results along these lines have been obtained by Scheurle 
and Marsden [1984]. Their results imply that on a Cantor set of parameter 
values having positive Lebesgue measure one has invariant two-tori having 
quasiperiodic flow. Many of these issues are discussed in the section on 
circle maps in Section 21.6. 



Heteroclinic Cycle 

In Case III the truncated normal form has two saddle-type fixed points 
on the z axis; pi having a two-dimensional stable manifold and a one- 
dimensional unstable manifold and p 2 having a two-dimensional unstable 
manifold and a one-dimensional stable manifold with coinciding 

with 1 T“(p 2) to form the invariant sphere as shown in Figure 33.2.3. The 
axis of the sphere is formed from the coincidence of a branch of bF“(pi) 
with a branch of lF®(p 2 )- 

We expect the effects of the higher order terms of the normal form 
to drastically alter this situation. Generically, we would expect the one- 
dimensional unstable manifold of pi and the one-dimensional stable man- 
ifold of p 2 to not intersect in the three-dimensional phase space. Simi- 
larly, generically we would expect the two-dimensional stable manifold of 
P 2 and the two-dimensional unstable manifold of pi to intersect along one- 
dimensional orbits in three-dimensional phase space. We illustrate these 
two solutions in Figure 33.2.4. 

Now it may happen that when this degenerate structure is broken the 
branch of kF“(pi) inside the sphere falls into kF®(pi), or, similarly, the 
branch of W^{p 2 ) inside the sphere falls into lF“(p 2 ); see Figure 33.2.5. 

If this happens, then from Section 27.3 the reader should realize that it 
is possible for a return map defined near one or the other of these homo- 
clinic orbits to possess a countable infinity of Smale horseshoes, i.e., Sil- 
nikov’s phenomenon may occur. Moreover, from Section 27.3 a countable 
infinity of period-doubling and saddle-node bifurcation values would accu- 
mulate on the parameter value at which the homoclinic orbit was formed. 
Thus, this particular local codimension-two bifurcation would actually be 
codimension-infinity when all of the dynamical effects are included (see the 
comments at the end of Chapter 32). 
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The fact that this local codimension-two bifurcation point can exhibit 
Silnikov’s phenomenon in its versal deformation has been proved by Broer 
and Vegter [1984]. They prove that the full normal form possesses orbits 
homoclinic to a hyperbolic fixed point. Their result is very delicate in the 
sense that, due to the rotational symmetry of the linear part of the normal 
form, the symmetry is preserved to all orders in the normal form (i.e., the r 
and z components of the normal form are independent of 9) . Thus, the ho- 
moclinic orbits are a result of exponentially small terms that are not picked 
up in the Taylor expansion and subsequent normal form transformation of 
the vector field. 




FIGURE 33.2.4. a) Cross-section of the manifolds for the full normal form, b) 
Homoclinic orbit for the full normal form. 




FIGURE 33.2.5. Possible homoclinic orbits for the full normal form. 
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The Hamiltonian-Dissipative Decomposition 

The key aspect that enabled the preceeding analyses to go through was 
the fact that one could find an “inspired” rescaling that transformed the 
normal form into a perturbation of an integrable Hamiltonian system with 
the higher order terms in the normal form part of the perturbation (note: 
the perturbation need not also be Hamiltonian). Once this has been ac- 
complished, a wealth of techniques for the global analysis of nonlinear dy- 
namical systems can then be employed; for example, Melnikov theory, per- 
turbation theory for normally hyperbolic invariant manifolds, and KAM 
theory. However, the real question is, “in a given problem, how can we find 
the rescalings which turn the problem into a perturbation of a completely 
integrable Hamiltonian system?” 

The rescalings in this section are due to the cleverness of Takens [1974], 
Bogdanov [1975], Guckenheimer and Holmes [1983], Kopell and Howard 
[1975], and looss and Langford [1980]. However, in recent years there has 
been an effort to understand the structure of the normal form that leads to 
a splitting of the normal form into a Hamiltonian part and a dissipative part 
so that the appropriate rescalings can be generated by some computational 
procedure. We refer the reader to Lewis and Marsden [1989] and Olver and 
Shakiban [1988]. 



33.3 Exercises 



1. Show that the function /(cn) defined in (33.1.16) is monotone. 

2. Show that the two-parameter family 

X ^ y, 

y = fii + ii 2 V + 3? + bxy, 6 = ±1, (33.3.1) 

is a versal deformation of a fixed point of a planar vector field at which the matrix 
associated with the linearization has the form 

(s ;)■ 

{Hint: the idea is to show that the neglected higher order terms in the normal form 
do not introduce any qualitatively new dynamics in the sense that the bifurcation 
diagram in Figure 33.1.3 is unchanged. Begin by considering the fixed points and local 
bifurcations and show that these are qualitatively unchanged. Next, consider the global 
behavior, i.e., homoclinic orbits and “large amplitude” periodic orbits. The Melnikov 
theory can be used here. 

Once all of these results are established, does it then follow that (33.3.1) is a versal 
deformation? ) 

3. Recall Exercise 1 following Section 20.6, the double-zero eigenvalue with the symmetry 
(rc, y) {—X, —y)- The normal form was given by 
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X ^ y, 

y — fiix + fi 2 V + cx^ — x^y, c — ±1. (33.3.2) 

In this exercise we want to analyze possible global behavior that might occur. 

a) For c = +1, using the rescaling 

X — eu, y — s^v, //i — — fi 2 — s^U 2 
and t —¥ show that (33.3.2) becomes 
ii — V, 

V = —u + + e{u 2 V — u^v). (33.3.3) 

b) Show that (33.3.3) is Hamiltonian for £ = 0 and draw the phase portrait. 

c) Use the Melnikov theory to show that (33.3.3) has a heteroclinic connection on 

112 = + •• • . (33.3.4) 

5 

What is the form of the higher order terms in (33.3.4) (i.e., 0(//^), where a is 
some number)? This is important for determining the behavior of the bifurcation 
curves at the origin. 

d) Show that (33.3.3) has a unique periodic orbit for < 0 between y .2 — 0 and 
A*2 = -^+-". 

e) Draw the complete bifurcation diagram for (33.3.2) with c = +1. Is (33.3.2) a 
versal deformation for c = +1? 

f) For c = —1, using the rescaling 

X — £u, y — £^v, yii — (12 — e^U2, 

and t ^ , show that (33.3.2) becomes 

u — V, 

V — u — -\- £{u 2 V — u^v). (33.3.5) 

g) Show that (33.3.5) is Hamiltonian for £ = 0 and draw the phase portrait. 

h) Using the Melnikov theory, show that (33.3.5) undergoes a homoclinic bifurca- 
tion on 

4 

M2 = rMi H > (33.3.6) 

5 

and a saddle-node bifurcation of periodic orbits on 

(i2 — c(i\ , (33.3.7) 

with c ~ .752. Hence, for (i\ > 0, between (12 — (ii and fi 2 — f Mi + ‘ ‘ ‘ > 
(33.3.2) has three periodic orbits for c = —1. What are their stabilities? Between 
fi 2 — ^{ii + • • • and (12 — c(ii -!-••• (33.3.2) has two periodic orbits for c — — 1. 
What are their stabilities? Below (I 2 — c(ii -!-••• there are no periodic orbits for 
c ^ -1. 

In (33.3.6) and(33.3.7) what is the form of the higher order terms (i.e., 0 (m 7) 
where ex is some number). 

1. Draw the complete bifurcation diagram for (33.3.2) with c = —1. Is (33.3.2) 
a versal deformation for c = — 1? 



{Hint: the Melnikov theory for autonomous systems is developed in Exercise 11 and the 
Melnikov theory for heteroclinic orbits is developed in Exercise 16 following Chapter 
28.) 
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4. Show that all the cubic terms, except 2 ^, can be eliminated from (33.2.2) so that it 
takes the form of (33.2.3). {Hint: this result is due to J. Guckenheimer. 

Consider the following coordinate transformation 

s ^ r(l + gz), 
w — z + hr‘^ + iz^ , 

T = (1 + jz)~'^t. 



where g, h, i, j are unspecified constants, they will be chosen to make the equations 
simpler. In these new coordinates (33.2.2) becomes 

ds 2 2 

— — nis + asw {c bg — ah)s {d — g — ai aj)sw 
dr 

+ Rs{s, w, fj. 1 , fj. 2 ), 

du) 00 ^ ^ K X 9 

= ^2 + bs^ — + (e — 2bg + 2(a + l)h + 2bi + bj)s‘^w 

dr 

+ (/ - /ii. ^*2), 



where the remainder terms are 0(4) in s, w, pi, p 2 - We will ignore Rs and R^ and 
choose g, h, z, j so as to make the cubic terms as simple as possible. If we think of the 
cubic terms as being in a vector space spanned by 




the problem of annihilating the cubic terms reduces to solving the linear problem 

Ax — B, 

where 




( b —a 0 0 \ 

-1 0 -a a \ 

-2b 2a + 2 2b b \ ' 

0 0 0 - 1 / 

It is not hard to show that A has rank 3 and that we can eliminate all cubic terms 
except ^^3^- Thus, since our transformation did not change our equation at 0(2) 
and below, we can write our normal form in the r, 2 coordinates as 



r — pir + arz, 

I , 2 2 , j; 3 

z — P2 +br - z + fz 



(where / can take on all values). ) 

5. Once the cubic terms have been restored, study the Poincare— Andronov-Hopf bifur- 
cation in Case IIa,b and Case III. 

6. Verify that the homoclinic bifurcation shown in Figure 33.2.2 occurs for a = 2, / < 0. 

7. Suppose that W“(pi) falls into VF'^(pi) as shown in Figure 33.2.5. What are the 
conditions on the normal form in order that the hypotheses of Theorem 27.3.2 hold. 

Suppose W^{p 2 ) falls into W^{p 2 ) as shown in Figure 33.2.5. Can horseshoes also 
occur in this case? What conditions must the normal form satisfy? 
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Glossary of Frequently Used 
Terms 



Absorbing set: A positive invariant compact subset B C K”" is called an absorbing set if 
there exists a bounded subset of U, with U D B, and: 

flows: > 0 such that <p{t, U) G B, > tjj . 

maps: nu > 0 such that g^{U) C -B, Vn > nu . 

Action: The functional 

r^i 

/ L{qi, . . . ,qN .qi, ■ ■ ■ ,qN ,t)dt, 

JtQ 

defined on curves in configuration space (such that the endpoints of each curve are fixed) 
and where L is the Lagrangian, is referred to as the action. 

Action-Angle Coordinates, or Variables: These coordinates can be constructed for com- 
pletely integrable Hamiltonian systems that satisfy the hypotheses of the Liouville-Arnold 
theorem, and they are denoted by {9, /), 9 G T^, I G R^. In these coordinates the Hamil- 
tonian is a function of only the action variable, 7, i.e., H — H{I), and Hamilton’s equations 
become: 



9 = 



dH 

W 



dH 

^ ~ ~ 



These equations can be easily integrated to yield the trajectories: 



e(t) = —(io)t + 9{o), 

7(f) = 7 q = constant. 



Hence, these are the trajectories on the n dimensional tori described by the Liouville- 
Arnold theorem. is referred to as the frequency of the torus I — Iq. 

Asymptotically Autonomous Vector Field: A vector field x — f{x, f), x G R”^, is said to 
be asymptotically autonomous if limt_,.oo t) — g{x) exists. 

Asymptotic Stability: Roughly, a trajectory is said to be asymptotically stable if it is 
Liapunov stable and, at a given time, trajectories starting close enough to the specified 
trajectory approach the trajectory as time increases. 

Attracting Set: Let <^(f, x) denote the flow generated by a vector field and let x i-^ 9 {x) 
denote a map. A closed invariant set A C R^ is called an attracting set if there is some 
neighborhood H of A such that: 



flows: Vf > 0, 
maps: Vn > 0, 



and 



0(t,H) C U 
g"(u) c u 



and 



n„>o9^(U) = A. 
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Attractor: An attractor is a topologically transitive attracting set. 

Autonomous System: A dynamical system that does not depend explicitly on the indepen- 
dent variable (which is often referred to as “time”). 

Basin of Attraction: Let (p{t, x) denote the flow generated by a vector field and let x g{x) 
denote a map. The domain or basin of attraction of an attracting set A is given by 

flows: U <p{t, U), 

t<0 

maps: U 9''(U), 

n<0 

where U is any trapping region. 

Bifurcation of a Fixed Point: A fixed point {x,p) = (0,0) of a one-parameter family 
of one-dimensional vector fields, x — /(a;,/j.), or maps, x i-^ f{x,p.), is said to undergo 
a bifurcation at /j. = 0 if the flow for p near zero and x near zero is not qualitatively 
the same as the flow near ai = 0 at /r = 0. A necessary, but not sufficient, condition for 
bifurcation of a fixed point is that the fixed point is non-hyperbolic. 

Cantor Set: A compact, totally disconnected, perfect set. 

Center Manifold of a Fixed Point: The center manifold of a fixed point is an invariant 
manifold passing through a fixed point which is tangent to the center subspace at the 
fixed point, and has the same dimension as the center subspace. 

Center Subspace of a Fixed Point: Consider an (autonomous) ordinary differential equa- 
tion (resp. map) linearized about a fixed point, and the Jacobian matrix associated with 
this linearization. The center subspace associated with the fixed point is the span of the 
generalized eigenvectors corresponding to eigenvalues of the Jacobian matrix having zero 
real part (resp. modulus one). It is an invariant manifold under the linearized dynamics. 

Chaotic Invariant Set: A compact set A invariant under a flow 4>{t, x) (or map g{x)) is said 
to be chaotic if 

1. 4>{t,x) (resp. ^(a:)) has sensitive dependence on initial conditions on A. 

2. 4>{t,x) (resp. ^(a:)) is topologically transitive on A. 

Cocycle: Let P denote a parameter space. Let 0 = {9± \ t G K} denote a one-parameter 
family of mappings of P into itself, i.e., 

9t : P ^ P, 

P I-*- 6tp, 



with 

9t o 9s ^ 9t+s, Vt, s G IR, 

and 

9q ^ id. 

Then a family of mappings 

^ i G E, p G P, 

is called a cocycle on with respect to a group 0 of mappings on P if: 
1- 00, p = id, 

2 . (pt-\-s,p — 4^t,9sP ^ 4^3, p: 
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for all f , s G p G P. 

Codimension of a Submanifold: Let M be an m-dimensional manifold and let N be an 
n-dimensional submanifold contained in M; then the codimension of N is defined to be 
m — n. Equivalently, in a coordinate setting, the codimension of N is the number of 
independent equations needed to define N . Thus, the codimension of a submanifold is a 
measure of the avoidability of the submanifold as one moves about the ambient space; in 
particular, the codimension of a submanifold N is equal to the minimum dimension of a 
submanifold P (Z M that intersects N such that the intersection is transversal. This is 
the definition of codimension in a finite-dimensional setting, which permits some intuition 
to be gained; now we move to the infinite-dimensional setting. Let M be an infinite- 
dimensional manifold and let be a submanifold contained in M. Roughly speaking, 
an infinite-dimensional manifold is a set which is locally diffeomorphic to an infinite- 
dimensional Banach space. Because infinite-dimensional manifolds are discussed in this 
section only, and then mainly in a heuristic fashion, we refer the reader to the literature 
for the proper definitions.) We say that N is of codimension k if every point of N is 
contained in some open set in M which is diffeomorphic to C7 X where U is an open 
set in N. This implies that k is the smallest dimension of a submanifold P (Z M that 
intersects N such that the intersection is transversal. Thus, the definition of codimension 
in the infinite-dimensional case has the same geometrical connotations as in the finite- 
dimensional case. Now we return to our main discussion. (For the case of “codimension 
oo”, in this definition is replaced with an infinite dimensional Banach space.) 

Completely Integrable Hamiltonian System: Hamilton’s equations, for n degrees-of- 
freedom, are said to be completely integrable if there exists n functions (called integrals) 

Fi = H, F 2 , ‘ ‘ ‘ , Fn, {H is the Hamiltonian), 



which satisfy the following conditions. 



1. The Pi, f = 1, • • • , n, are functionally independent on C7, with the possible exception 
of sets of measure zero. 

2. {Pi, Fj} — 0, for all i and j, where {•, •} denotes the Poisson bracket. 



Configuration Space: The space of generalized coordinates of a system is often referred to 
as the configuration space of the system. 

Conjugacy: A change of variables, or coordinates. Consider two diffeomorphisms /: R^ 
and g:M.^ R”", and a diffeomorphism R”'. / and g are said to be 

conjugate {k < r) if there exists a diffeomorphism h\ R^ —¥ R”" such that g o h — ho f . 
If k — 0, f and g are said to be topologically conjugate. 

The conjugacy of two diffeomorphisms is often represented by the following diagram. 




The diagram is said to commute if the relation g o h — h o f holds, meaning that you can 
start at a point in the upper left-hand corner of the diagram and reach the same point in 
the lower right-hand corner of the diagram by either of the two possible routes. We note 
that h need not be defined on all of R”^ but possibly only locally about a given point. In 
such cases, / and g are said to be locally conjugate. 

Conley-Moser Conditions: A set of geometrical and analytical criteria for a map that, if 
satisfied, imply that the map behaves like a Smale horseshoe. 

C’’ Function: A function is said to be if it is r times differentiable, and if each of the 
deriviatives is a continuous function. For r = 0 the function is merely continuous. 

Diffeomorphism: A function is said to be a diffeomorphism if the function is invert- 
ible, and if both the function and its inverse are functions. For r = 0 the function is 
said to be a homeomorphism. 

Cyclic Coordinate: A synonym for ignorable coordinate. 

Degrees of Freedom: The phrase degrees of freedom has its origins in mechanics. The num- 
ber of degrees of freedom of a mechanical system is the number of independent coordinates 
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that are required to describe the configuration of the system (i. e., “position coordinates” ). 
Thus, the number of degrees of freedom is half the dimension of the phase space of the 
corresponding Hamiltonian dynamical system. If an n degree of freedom system also de- 
pends explicitly on time (i.e., the vector field is nonautonomous), the the phrase n ^ 
degree of freedom system is often used. 

Diophantine Frequency: A frequency vector lo G is said to be diophantine if there 
exists constants r, 7 > 0 such that the following infinite number of inequalities hold: 

|oj - fc| > 7|fcr^ Vfc G - {0} 



where 

\k\ = sup \ki\. 



Distance Between a Point and a Set: Let S G be an arbitrary set and p G be 
an arbitrary point. Then the distance between the point p and the set S is denoted and 
defined as: 

d{p,S)— inf |p — £c|. (34.0.1) 

x^S 

Double-Hopf Bifurcation: The bifurcation associated with a nonhyperbolic equilibrium 
point of an autonomous vector field where the matrix associated with the linearization of 
the the vector field about the equilibrium point has the following Jordan canonical form: 



( 0 cJi 0 0 \ 

oji 0 0 0 I 

0 0 0 —UJ2 I 

0 0 C02 0 / 



Duffing Equation: The second order, ordinary differential equation: 

X — X — 0, a: G K., 



or, written as a system 



X = y, 

y = X — x^ , {x,y)^R'^. 



Sometimes damping and external driving (time-dependence) are added 

X ^ y, 

y = X - - Sy + -ff{t), (x,y)eS.'^, 

in this case it is referred to as the damped (<5p), driven ( 7 /(f)) Duffing equation, where (5 
and 7 are parameters. 

Dynamical System: A system that changes in time; usually described by differential equa- 
tions (continuous time) or difference equations (sometimes called “maps) (discrete time), 
or, possibly, some combination of the two. 

Elliptic Equilibrium Point: An equilibrium point of an autonomous Hamiltonian system is 
said to be elliptic if all of the eigenvalues of the matrix associated with the linearization 
about the equilibrium point are purely imaginary, and nonzero. 

Equilibrium Solution: A constant solution of a vector field, i.e. a point in phase space 
where the vector field is zero, also referred to as a fixed point. 

Exponential Dichotomy: The linear field x = A{t)x, x G K”" is said to possess an exponen- 
tial dichotomy if each of the n linear independent solutions exhibit exponential growth 
or decay as t 00 . More precisely, suppose X{t) is the fundamental solution matrix of 
this linear vector field, i.e., for any initial condition — X{t)^o is the solution of 

the vector field passing through at t — 0, A(0) = id. Let || • || denote a norm on R”". 
Then the linear vector field is said to possess an exponential dichotomy if there exists a 
projection operator P, — P, and constants Pi, K 2 , Ai, A 2 > 0, such that 
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II X{t)PX ^ (t) II < K\ exp ( — Ai(i — r)), t > t, 
II X{t) (id — P) X~^{t) II < K2 exp (X2{t — r)) , t < r. 



Extended Phase Space: The cartesian product of the phase space with the independent 
variable (which is often referred to as “time”). 

Fold Bifurcation: Another term for a saddle-node bifurcation. 

F-Equivariant Map: Let f \ V —¥ V denote a mapping of a vector space V into V . Let F 
denote a compact Lie group with a specified action on V. We say that / is T -equivariant 
with respect to this action if 

/{■yx) = yf{x), \/y e r, X e V. 

F-Invariant Function: Now let f : V M. denote a mapping of a vector space V into R. Let 
F denote a compact Lie group with a specified action on V. We say that / is T -invariant 
with respect to this action if 



fii^) = Vy e r, X e V. 



If / is a vector field then we will refer to it as a F- equivariant vector field. It follows that if 
x(t) is a solution of a F-equivariant vector field x — f{x), then 'yx{t) is also a solution, 
for each 7 G F. 

General Linear Group, GL(R”'): GL(MP) is the group of linear, invertible transformations 
of into R”", which we can view as the group of nonsingular n X n matrices over R. 

Generalized Coordinates: A set of coordinates that contain the minimum number of in- 
dependent coordinates needed to specify all the positions of a set of particles is referred 
to as generalized coordinates, and denoted by {qi, . . . , qN)- Generalized coordinates may 
be distances, angles, or quantities relating them. The integer N is referred to as the num- 
ber of degrees- of- freedom of a system. So a system is said to have N degrees-of-freedom 
if the positions of all its components can be described by N independent (generalized) 
coordinates. 

Generalized Momentum: pi — is referred to as the ith component of generalized 

momentum. 

Generalized Velocity: (qi,---,qN) are referred to as generalized velocities, where 

{qi, . . . , g^) are generalized coordinates. 

Generating Function: Consider the Hamiltonian 






and the associated Hamilton’s equations: 



. 

q = ^(9,pU), 

op 

P = -^(^.PU)- 
dq 



Suppose we have a transformation of coordinates of the form 



Q = Q(q,p, t), 

P = P(q,p,t), 



which we assume can be inverted (viewing t as fixed) to yield 

q = q(Q, P, t), 

p = p{Q,P,t). 



The Hamiltonian in the original {q,p) coordinates can be written as a function of the (Q, P) 
coordinates: 
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H{q,p,t) = H(Q,P,t). 



In the Q — P coordinates Hamilton’s equations will hold, i.e., 



Q = ^iQ,P,t), 



provided the coordinate transformation satisfies: 



Pi = 



dF 



Pi = 



dF 



H ^ H + 



dF 

dt 



where the function F satisfies: 

AT N 

dF(q, Q,t) = ^pidqi -^PidQi + {H - H)dt. 

i=l i=l 



The function F is referred to as a generating function. 

Generic Property: A property of a map (resp. vector field) is said to be generic if the 
set of maps (resp. vector fields) possessing that property contains a residual subset in the 
topology. 

Gradient Vector Field: A gradient vector field is a vector field having the form x — 
W(a:), X G where V{x) is a scalar valued function. 

Group: A group is a set, G, equipped with a binary operation on the group elements, denoted 
and referred to as group multiplication, which satisfies the following three properties. 

1 . G is closed under group multiplication, i.e., pi , p2 ^ G ^ G. 

2 . There exists a multiplicative identity element in G, i.e., there exists an element 
e G G such that e^g — g^g — g, for any g ^ G. 

3 . For every element of G there exists a multiplicative inverse, i.e., for every p G G, 
there exists an element, denoted p~^, such that p * g~^ — g~^ * g — e. 

4 . Multiplication is associative, i.e., (pi*p2)*p3 — pi*(p2*p3)5 for any pi, p2, p3 G G. 
Hamilton’s Equations: The ordinary differential equations: 



dH 




Pi ^ f = 1, . . . , AT, 

dqi 



are referred to as Hamilton’s equations, where H — H{q,p) is a scalar valued function 
called the Hamiltonian. 

Hamilton-Jacobi Equation: The partial differential equation 



d^ 

dt 



+ H 




= 0, 



for some scalar valued function H {q G R^) is referred to as the Hamilton-Jacobi equation. 
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Hausdorff Metric: Let A and B be two nonempty, compact subsets of The Hausdorff 
metric defined on the set of nonempty, compact subsets of is defined as: 

= max{H*(A,B),H*{B,A)) . 



Hausdorff Separation: Let A and B be two nonempty, compact subsets of R^. The Haus- 
dorff separation of A and B, denoted H*{A, S), is defined as: 

H* (A. B) = max dist (a, B) — max min || a — b || . 

cx ^ A 3, ^ A b ^ B 



Heteroclinic Orbit: An orbit is said to be heteroclinic to the two invariant sets Ai and 
A2 if it approaches Ai asymptotically under time evolution as time goes to — oo and 
approaches A2 asymptotically under time evolution as time goes to -|-oo. If the invariant 
sets have stable and unstable manifolds then a heteroclinic orbit lies in the intersection 
of the unstable manifold of Ai and the stable manifold of A2. 

Homoclinic Orbit: An orbit is said to be homoclinic to an invariant set if it approaches 
the invariant set asymptotically under time evolution as time goes to ±00. If the invariant 
set has stable and unstable manifolds then a homoclinic orbit lies in the intersection of 
the stable and unstable manifolds. 

Hopf Bifurcation: See Poincare- Andronov-Hopf bifurcation. 

Hopf-Steady State Bifurcation: The bifurcation associated with a nonhyperbolic equilib- 
rium point of an autonomous vector field where the matrix associated with the lineariza- 
tion of the the vector field about the equilibrium point has the following Jordan canonical 
form: 

/ 0 -u; 0\ 

[ LU 0 0 . 

\ 0 0 0 / 

Hyperbolic Fixed Point: A fixed point of a vector field that has the property that none of 
the eigenvalues of the matrix associated with the linearization of the vector field at the 
fixed point lie on the imaginary axis. A fixed point of a map or difference equation that has 
the property that none of the eigenvalues of the matrix associated with the linearization 
of the map at the fixed point lie on the unit circle. 

Hyperbolic Trajectory: The trajectory x{t) of the vector field x — f{x,t) is said to be 
hyperbolic if the associated linear equation, ^ = Dx f {x{t) , t)^, possesses and exponential 
dichotomy. 

Ignorable Coordinate: A generalized coordinate that does not appear in the Lagrangian 
(although the generalized velocity corresponding to the coordinate may appear in the 
Lagrangian). 

Index of a Closed Curve: Consider an autonomous vector field defined in some simply 
connected region, 7 ^, of the plane (this is a two-dimensional idea only). Let T be any 
closed loop in TZ which contains no fixed points of the vector field. You can imagine at 
each point, p, on the loop F that there is an arrow representing the value of the vector 
field at p. 

Now as you move around F in the counter-clockwise sense (call this the positive direction), 
the vectors on F rotate, and when you get back to the point at which you started, they 
will have rotated through an angle 27 t/c, where k is some integer. This integer, fc, is called 
the index of F. 

The index of a closed curve containing no fixed points can be calculated by integrating the 
change in the angle of the vectors at each point on F around F (this angle is measured 
with respect to some chosen coordinate system). For a vector field defined on some simply 
connected region, TZ, of the plane given by 

X - f(x,v), 

y = g{x,y), (x,y) eK 



the index of F, k, is found by computing 
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1 

27T 

1 

2-7T 





y) \ 
f{x,y)) 



r f dg - gdf 
r P+9^ 



Infinitesimally Symplectic Transformations: Consider a r > 1, map / : 
Then / is said to be an infinitesimally symplectic or Q skew transformation if 

Q,{Df(x)u, v) — —Q,{u, Df{x)v), Va:, u,v ^ R^*^, 



where Q denotes a symplectic form on R^”". In terms of the canonical symplectic form this 
relation takes the form: 

JDf(x) + Df(x)'^J = 0. 

Infinitesimally symplectic maps arise in the study of the linearization of Hamiltonian 
vector fields at equilibria. 

Infinitesimally Reversible Linear Map: Suppose G : R”^ R”" is a linear involution and 

A : R”" R^ is a linear map. Then A is said to be infinitesimally reversible if 

AG + GA^O. 



This linearization condition is defined at a symmetric fixed point. Infinitesimally reversible 
linear maps arise in the study of the linearization of reversible vector fields at symmetric 
fixed points. 

Integral Curve: A trajectory in extended phase space. 

Invariant Manifold: An invariant set, which is also a manifold. 

Invariant Set: A set of points in phase space that has the property that trajectories with 
initial conditions in the set remain in the set forever. 

Involution: Consider a G^ map 

G : R"" 1-^ R"" , 



satisfying 

GoG^id, 

where id stands for the identity map. G is called an involution 

Kolmogorov- Arnold-Moser (KAM) Theorem: This is a perturbation theorem that ap- 
plies to perturbations of completely integrable Hamiltonian systems expressed in action- 
angle coordinates. It states that for sufficiently small perturbations the tori in the un- 
perturbed, completely integrable system corresponding to diophantine frequency vector 
persist for the perturbed system. 

Lagrangian: The scalar valued function L = T — V is referred to as the Lagrangian of a 
systems, where T is the kinetic energy and V is the potential energy. 

Lagrange’s Equations: ^ = 0, i = 1, . . . , N, are referred to as Lagrange’s equa- 

tions for an N degree-of-freedom system, where gi, . . . are generalized coordinates, 
and L is the Lagrangian. 

Liapunov Function: A scalar valued function, defined on the phase space (or, possibly just 
some subset thereof) whose level sets define trapping regions for the trajectories of a 
dynamical system. 

Liapunov Stability: Roughly, a trajectory is said to be Liapunov stable if, at a given time, 
trajectories starting close enough to the specified trajectory remain close for all later time. 

Lie Group: A Lie group is a closed subgroup of GL{MA), which we will denote by F. 

A subgroup is a subset of a group, which obeys the same axioms as a group (but with the 
important point that it is the subset that is closed with respect to group multiplication). 
The term “closed” in the definition of a Lie group needs clarification. We can identify the 
space of all n X n matrices with . Then GL(R”') is an open subset of R^ . F is said to 
be a closed subgroup if it is a closed subset of GL(R^), as well as a subgroup of GL(R"^). 
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If this closed subset is compact or connected then the associated Lie group is also said to 
be compact or connected. 

Lie Group Action on a Vector Space: Let F denote a Lie group and V a vector space. We 
say that F acts linearly on V if there is a continuous mapping, referred to as the action: 

F X V ^ V, 

(7, 1?) 1-^ 7 • u, 



such that 

1 . For each 7 G F the mapping 



P-, -.V ^v, 

u 1-^ 7 • 1? = p-y{y). 

is linear. 

2 . (a) For any 71, 72 € F, 

71 - (72 • v) = (71 * 72) • V. 

(b) e ■ V = V. 

The Lift of a Circle Map: Consider the following map of the real line M to S^: 

n : II 



iTcix 



= 6 >. 



The map F : R — \ R is said to be a lift of / : if 

n o F — / o n. 



Linearization: The procedure of Taylor expanding a dynamical system in the dependent 
variable about a specific solution, or invariant set, and throwing away all but the terms 
linear in the dependent variable. 

Liouville- Arnold Theorem: This theorem describes the phase space structure of certain 
n degree-of-freedom completely integrable Hamiltonian systems. Suppose Fi, . . . , F^ are 
n integrals satisfying the definition of complete integrability. Then we examine the (in 
general) n dimensional sets defined by 

Fi = /i, 



Fn = fn, 



where the fi, i =■ 1 , . . . , n are constants. The Liouville- Arnold theorem states that if these 
sets are compact and connected, then they are actually n-tori. 

Multiplicity of a Resonance: A resonant frequency vector to G R”" is said to be of multi- 
plicity m < n if there exist independent fc* G — {0}, f = 1, . . . , m, such that k'^ ■ cu — 0 . 

Naimark-Sacker Torus Bifurcation: A bifurcation of a fixed point of a map where the 
linearization has a pair of eigenvalues on the unit circle. The eigenvalues satisfy a non- 
resonance condition in the sense that they are no one of the first four roots of unity, i.e., 
if A denotes one of the eigenvalues then A^ 1 , n = 1 , 2 , 3 , 4 . The bifurcation of the 
fixed point gives rise to an invariant circle. This bifurcation is sometimes referred to as 
the “Hopf bifurcation for maps”. 

Negative Invariant Manifold: A negative invariant set that is also a manifold. 

Negative Invariant Set: A set of points in phase space that has the property that trajec- 
tories with initial conditions in the set remain in the set for all negative time. 
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Nekhoroshev’s Theorem: This is a perturbation theorem that applies to perturbations of 
completely integrable Hamiltonian systems expressed in action-angle coordinates. It states 
that for sufficiently small perturbations the action variable change by a small amount over 
an exponentially long time interval. 

Node: An equilibrium point of a vector field having the property that the matrix associated 
with the linearization at the fixed point has all eigenvalues either in the right or the left 
half plane. Hence, a node is a hyperbolic fixed point that is either attracting or repelling. 

Nonautonomous System: A dynamical system that depends explicitly on the independent 
variable (which is often referred to as “time”). 

Nonwandering Points: Let 0(f, x) denote the flow generated by a vector field and let 
X I— g{x) denote a map. A point xq is called nonwandering if the following holds. 

Flows: For any neighborhood U of xq and T > 0, there exists some |t| > T such that 

4>{t,u)r\u 



Maps: For any neighborhood U oi xq, there exists some n 7 ^ 0 such that 

g'*((7) nU^a. 

Note that if the map is noninvertible, then we must take n > 0. 

Nonwandering Set: The set of all nonwandering points of a map or flow is called the 
nonwandering set of that particular map or flow. 

CO and a Limit Points of Trajectories: Let 4>{t, x) denote the flow generated by a vector 
field. A point xq G is called an w limit point oi x ^ IR”', denoted w{x), if there exists 
a sequence {ti}, ti — > 00 , such that 



4>{ti , x) 



Xq. 



O' limit points are defined similarly by taking a sequence {fi}, ti — > — 00 . 

CO and o Limit Sets of a Flow: The set of all w limit points of a flow or map is called the 
Lj limit set. The a limit set is similarly defined. 

Normal Form: A local theory that applies in the neighborhood of an orbit of a vector 
field or map. The theory provides an algorithmic way to generate a sequence of nonlinear 
coordinate changes that eliminate as much of the nonlinearity as possible at each order 
(where “order” refers to the terms in a Taylor expansion about the orbit). Interestingly, 
the form of the nonlinearity that cannot be eliminated by such coordinate changes is 
determined by the structure of the linear part of the vector field or map. 

Orbit: The set of points in phase space through which a trajectory passes. 

Orbital Derivative: Consider a dynamical system, with its phase space, and a function 
defined on this phase space. The orbital derivative of the function is the derivative of the 
function along trajectories of this dynamical system. 

Order of a Resonance: Suppose co G is a frequency vector and k ■ co — 0 for some 
k G 2”" — {0}. Then the order of this resonance is defined to be |fc| = 

Perfect Set: A set is called perfect if it is closed and every point in the set is a limit point 
of the set. 

Period-Doubling Bifurcation: A bifurcation of a fixed point of a map where the lineariza- 
tion at the fixed point has an eigenvalue of —1. In this bifurcation the fixed point (i.e., 
period one point) changes its stability type and a period two orbit is created (or destroyed). 

Phase Curve: The solution of an ordinary differential equation, synonomous with the term 
trajectory. 

Phase Space: The space of dependent variables of an ordinary differential equation or map. 

Poincare-Andronov-Hopf Bifurcation: A bifurcation of a fixed point of a vector field 
where the linearization has a pair of purely imaginary eigenvalues (which are not zero). 
Provided a nondegeneracy condition involving the quadratic and cubic nonlinear terms 
holds, this bifurcation gives rise to a unique periodic orbit. 
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Poincare-Bendixson Theorem: A theorem that applies to two- dimensional, autonomous 
vector fields on the sphere, or on a compact, positively invariant subset of It states 
that the omega limit sets are either equilibrium points, periodic orbits, or a collection of 
orbits made up of fixed points and orbits connecting fixed points (heteroclinic orbits) that 
form a closed path. 

Poincare Map: A map obtained from the trajectories of an ordinary differential equation. 
There are many ways in which this can be done. However, a guiding principle is that the 
dynamics of the map must correspond to the dynamics of the vector field. 

Poisson Bracket: Let H, G : U M., U C. denote two C’', r > 2, functions. Then the 

Poisson bracket of these two functions is another function, and it is defined through the 
symplectic form on as follows: 

{H,G} = Q{Xh,Xg) = {Xh,JXg), 



where Xh and Xg denote the Hamiltonian vector fields corresponding to the Hamiltonian 
functions H and G, respectively. 

Positive Invariant Manifold: A positive invariant set that is also a manifold. 

Positive Invariant Set: A set of points in phase space that has the property that trajectories 
with initial conditions in the set remain in the set for all positive time. 

Pullback Absorbing Family: A family 

B — {Bp, p ^ P} , 



of compact subsets of R”" is called a pullback absorbing family for a cocycle 
{4>t,p-> t ^ R, P ^ -P} on R^ if for each p G P there is a bounded subset of R”^ , Up, satisfying 
Up D Bp, and a time tu^ > 0 such that: 

4>t,e_i.p{Ue_^p) C Bp, for all t > tUp. 

Pullback Attracting Set: A family A — {Ap, p G P} of compact subsets of R^ is called 
a pullback attracting set of a cocycle {0t,p, t G R, p G P} on R”" if it is invariant in the 
sense that: 

4^t,p (^p) — -^6-tpj t G R, p G P, 

and pullback attracting in the sense that for every Ap there exists a bounded subset of 
R^, Dp D Ap, such that 

{4>t,0_,p(De_^p), =0. 



Principle of Least Action: The principle of least action states that extrema of the action 
functional are solutions of Lagrange’s equations, and vice-versa. 

Quasiperiodic Motion: A function 



h : R^ ^ R^ 

t 1-^ h{t) 



is called quasiperiodic if it can be represented in the form 

h{t) = H{uJlt, . . . , UJ-nt), 

where H{xi,...,Xn) is a function of period 27 t in xi,...,Xn- The real numbers 
cji, . . . ,iOn are called the basic frequencies. We shall denote by G^{uJi, . . . ,LOn) the 
class of h{t) for which H{xi, . . . , Xn) is r times continuously differentiable. 

For example, the function 

h{t) — 7 i cos -1- 72 sin uj2t 



is a quasiperiodic function. 
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{Note: There exists a more general class of functions called almost periodic functions 
which can be viewed as quasiperiodic functions having an infinite number of basic fre- 
quencies. These will not be considered in this book, see Hale [1980] for a discussion and 
rigorous definitions.) 

A quasiperiodic solution 0(t), of an ordinary differential equation is a solution which is 
quasiperiodic in time. A quasiperiodic orbit is the orbit of any point through which (p{t) 
passes. A quasiperiodic orbit may be interpreted geometrically as lying on an n dimen- 
sional torus. This can be seen as follows. Consider the equation 

y — H{xi, . . . , Xrt). 



Then, if m > n and DxH has rank n for all x — (xi, . . . ,Xn)^ then this equation can be 
viewed as an embedding of an n-torus in m space with a:i, . . . , x^ serving as coordinates 
on the torus. Now, viewing h{t) as a solution of an ordinary differential equation, since 
Xi — iOit^ 2 = 1, . . . , n, h{t) can be viewed as tracing a curve on the n-torus as t varies. 

Representation of T on V : Let GL{V') denote the group of invertible linear transformations 
of V into V . The map 



p : r ^ GL{V), 
7 '-t P-y, 



is called a representation of F on V. 

Resonance: The frequency vector u) G is said to be resonant if there exists k G — {0} 
such that • cj = 0. If no such k G 2”" — {0} exists, lj is said to be nonresonant. 

Residual Set: Let X be a topological space, and let U he a subset of X . U is called a residual 
set if it is the intersection of a countable number of sets each of which are open and dense 
in X. If every residual set in X is itself dense in X, then X is called a Baire space. 

Reversible Dynamical System: Let x — f{x), x G be a vector field and let G : 
—¥ bbR^ be an involution. Then the vector field is said to be reversible if ^ (G{x)) = 
— / (G(a:)). A map Xn+i — 9{^n) is said to be reversible if g (G(a:n-i-i)) = G{xn)- 

Reversible Linear Map: Suppose G : —t R”' is a linear involution and A : R”' R”" is 

a linear map. Then A is said to be reversible if 

AGA - G. 



Note that this linearization condition is defined at a symmetric fixed point. Reversible 
linear maps arise in the study of the linearization of reversible maps at symmetric fixed 
points. 

Rotation Number: Consider an orientation preserving homeomorphism / : , with 

F a lift of /, we define the quantity 



po(F) = Jim 



n 



There are two main operational definitions of rotation number. For / : — >• an 

orientation preserving homeomorphism, with F a lift of /: 

1. Some authors define the rotation number of /, denoted p(/), as the fractional part 
of po(-^) (e-g- Devaney [1986]). 

2. Other authors define the rotation number of / to be po(^) (®-g-i Katok and Has- 
selblatt [1995]). 

In either case, the rotation number exists, and is independent of the point x. 

Saddle-Node Bifurcation: A bifurcation of a fixed point where one goes from zero to two 
fixed points as a parameter is varied through the bifurcation point. At the bifurcation 
point the linearization has a zero eigenvalue for vector fields and an eigenvalue of one for 
maps. This bifurcation is also referred to as a tangent or fold bifurcation. 

Sensitive dependence on initial conditions: Consider C’’ (r > 1) autonomous vector 
fields and maps on R^ denoted as follows 
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vector field 
map 



X = f{x), 
X !->■ g{x). 



(34.0.2) 

(34.0.3) 



Denote the flow generated by (34.0.2) by (p{t, x) and we assume that it exists for all 
f > 0. We assume that A C is a compact set invariant under (resp. g{x))^ i.e., 

(/)(f,A) C A for all f G K (resp. ^”^(A) C A for all n ^ Z, except that if g is not invertible, 
we must take n > 0). The flow <^(f, x) (resp. g{x)) is said to have sensitive dependence on 
initial conditions on A if there exists e > 0 such that, for any x ^ A and any neighborhood 
U of X, there exists y G U and t > 0 (resp. n > 0) such that \4>{t, x) — 4>{t, y)\ > e (resp. 
\a"{x)-g’'{y)\>e). 

Shift Map: A map defined on the space of bi-infinite sequences of N symbols (see symbol 
sequence). It acts by shifting the sequence one place to the left of the point separating 
the sequence into two infinite halves. More precisely, if we denote the shift map by cr, and 
a symbol sequence by 



: — - S_n ■ ■ ■ S_i.SoSl • • • • • •} 

where si G {1, 2, . . . , A”} Vz. 



Then 



"({• 



S-n ■ ■ ■ S-\Sq.S-]_ ■ 



•}) 



Skew-Product Flow: This is a formalism that can be applied to nonautonomous vector 
fields which allows one to retain the flow property. The crucial observation leading to the 
development of the approach is the following. Suppose a^(f) is a solution oi x — f{x,t). 
Then Xt (f ) = x{t -|- r) is a solution of x-r (f ) = f-r {xt (t),t) = f{x{t-\-T),t-\- r) 

We first define the space of nonautonomous vector fields whose time translates remain within 
the space, i.e.. 



T = {Space of functions / : X R ^ R”" such that 

/t-(-, -) = /(-, - + r) ^ Tiox all r G R.} 



We then define the group of shift operators on J-\ 

9^ ■. T ^ T, 

f^9^f = f^, VrGR, 



and we define the product space: 

A = R^ X :r. 

Let x{t, xq, f) denote a solution of i = f{x,t) with x{0, xq, f) — xq. Finally, we define 
the family of mappings: 

4^* : A ^ A, 

(xo,f) !->■ {x(t,xo, f),6tf) . 

It can then be shown that is a one-parameter family of mappings of A into A, or flow: 
^t + a(xo, f) = 'S’t O 'i'aixo, f)- 



Smale Horseshoe: As originally conceived by Smale, a two-dimensional, nonlinear, invertible 
map possessing an invariant set with a remarkable structure. The invariant set has the 
structure of a Cantor set. In addition, it has a countable infinity of periodic orbits of all 
possible periods, an uncountable infinity of nonperiodic orbits, and an orbit that is dense 
in the invariant set. The dynamics on the invariant set is chaotic. The map has since been 
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generalized to many other settings (n dimensions as well as infinite dimensions, and in 
also in the context of noninvertible maps). 

Space of Vector- Valued Homogeneous Polynomials of Degree fc, : Let { si , • • • , } 

denote a basis of and let y — (yi, ■ ■ ■ ,yn) be coordinates with respect to this basis. 
Now consider those basis elements with coefficients consisting of homogeneous polynomials 
of degree fc, i.e., 

VT'^ - - - )si , ^ k, 

3 = 1 

where rrij > 0 are integers. We refer to these objects as vector-valued homogeneous 
polynomials of degree k. The set of all vector-valued homogeneous polynomials of degree 
k forms a linear vector space, which we denote by Hk- A basis for Hk consists of elements 
formed by considering all possible homogeneous polynomials of degree k that multiply 
each Si. 

Stable Manifold of a Fixed Point: The stable manifold of a fixed point is an invariant 
manifold passing through a fixed point which is tangent to the stable subspace at the 
fixed point, and has the same dimension as the stable subspace. Trajectories starting in the 
stable manifold have qualitatively the same dynamics as trajectories in the stable subspace 
under the linearized dynamics. Namely, they approach the fixed point asymptotically as 
f ^ oo at an exponential rate. 

Stable Subspace of a Fixed Point: Consider an (autonomous) ordinary differential equa- 
tion (resp. map) linearized about a fixed point, and the Jacobian matrix associated with 
this linearization. The stable subspace associated with the fixed point is the span of 
the generalized eigenvectors corresponding to eigenvalues of the Jacobian matrix having 
negative real part (resp. modulus less than one). It is an invariant manifold under the 
linearized dynamics. Trajectories starting in the stable subspace approach the fixed point 
at an exponential rate as t oo. 

Strange Attractor: Suppose A. C is an attractor. Then A is called a strange attractor 
if it is also a chaotic invariant set. 

Structural Stability: Let (R”" , ) denote the space of maps of R”" into R^ . In terms of 

dynamical systems, we can think of the elements of C‘^(R”', R”') as being vector fields. We 
denote the subset of C^(R^,R^) consisting of the diffeomorphisms by Diff^(R”', R”"). 
Two elements of C^(R^,R^) are said to be e-close {k < r), or just close, if they, 
along with their first k derivatives, are within e as measured in some norm. Consider 
a map / G Diff^(M, M) (resp. a vector field in C^(M, M)); then / is said to be 
structurally stable if there exists a neighborhood A/" of / in the topology such that / 
is C® conjugate (resp. equivalent) to every map (resp. vector field) in A/". 

Symbol Sequence: The space of bi-infinite symbol sequences, denoted is the set of 

sequences of the form: 



s 






•} 



where Si G {1, 2, . . . , N} Vi. 



The term “bi-infinite” comes from the fact that two infinite sequences are concatenated, 
with the two “halves” separated by a period. 

Symbolic Dynamics: An approach for “modeling” the dynamics of a map or flow (or even 
a more general type of dynamical system) by the shift map acting on the space of symbol 
sequences. The symbols can often be thought of for labels of different regions in the phase 
space of the dynamical system and the shift dynamics describes the visitation of points 
to the different regions under the dynamics. 

Symmetric Fixed Point: In the study of reversible system we say that a point, say xq., is 
a symmetric fixed point if it is a fixed point of the dynamical system and if G{xq) — xq, 
where G is the involution used to define reversibility. 

Symplectic Form: By a symplectic form on R^^ we mean a skew-symmetric, nondegenerate 
bilinear form. By nondegenerate we mean that the matrix representation of the bilinear 
form is nonsingular. A vector space equipped with a symplectic form is called a symplectic 
vector space. For our phase space R^”^ a symplectic form is given by 



Q{u, v) = {u, Jv) 
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where (•, •) denotes the standard Euclidean inner product on and 



J = 



0 id 
-id 0 



where “id” denotes the n X n identity matrix. This particular symplectic form is referred 
to as the canonical symplectic form. 

Topological Transitivity: Let 4>{t,x) denote the flow generated by a vector field and let 
X I— >• g{x) denote a map. A closed invariant set A is said to be topologically transitive if, 
for any two open sets U , V (Z A 

flows: 3 t M U) n ^ 0, 

maps: 3 n G Z 3 g'^iU) Pi V" / 0. 

Symplectic or Canonical Transformations: Consider a C”', r > 1, diffeomorphism / : 
Then / is said to be a canonical or symplectic transformation if 

n(u, u) — Q(Df{x)u, Df{x)v), Vx, u,v ^ 

where Q is a symplectic form on Equivalently, a canonical transformation is one that 

is defined by a generating function. 

Takens-Bogdanov Bifurcation: The bifurcation associated with a nonhyperbolic equilib- 
rium point of an autonomous vector field where the matrix associated with the lineariza- 
tion of the the vector field about the equilibrium point has the following Jordan canonical 
form: 




Tangent Bifurcation: Another term for a saddle-node bifurcation. 

Transcritical Bifurcation: A bifurcation of a fixed point where one goes from two fixed 
points, to one at the bifurcation point, and back to two as one varies the parameter 
through the bifurcation point. In this scenario one observes the fixed points colliding 
and then moving apart, but interchanging stability types. At the bifurcation point the 
linearization has a zero eigenvalue for vector fields and an eigenvalue of one for maps. 

Transversality: Let M and N be differentiable (at least C^) manifolds in R”^. Let p be a 
point in R^; then M and N are said to be transversal af p if p 0 M C\ N\ or, if p G M C\ N , 
then TpM 3- TpN — R”', where TpM and TpN denote the tangent spaces of M and N, 
respectively, at the point p. M and N are said to be transversal if they are transversal at 
every point p G R^ . 

Trapping Region: The open set U in the definition of attracting set is referred to as a 
trapping region. That is, points starting in U remain in U for all later times. 

Trajectory: The solution of an ordinary differential equation, synonomous with the phrase 
phase curve. 

Unstable Manifold of a Fixed Point: The unstable manifold of a fixed point is an in- 
variant manifold passing through a fixed point which is tangent to the unstable subspace 
at the fixed point, and has the same dimension as the unstable subspace. Trajectories 
starting in the unstable manifold have qualitatively the same dynamics as trajectories in 
the unstable subspace under the linearized dynamics. Namely, they approach the fixed 
point asymptotically as t — oo at an exponential rate. 

Unstable Subspace of a Fixed Point: Consider an (autonomous) ordinary differential 
equation (resp. map) linearized about a fixed point, and the Jacobian matrix associated 
with this linearization. The unstable subspace associated with the fixed point is the span 
of the generalized eigenvectors corresponding to eigenvalues of the Jacobian matrix having 
positive real part (resp. modulus greater than one). It is an invariant manifold under the 
linearized dynamics. Trajectories starting in the unstable subspace approach the fixed 
point at an exponential rate as t — oo. 

Versal Deformation: Roughly speaking, a versal deformation of a degenerate “object”, such 
as a vector field, map, or matrix, is a parametrized family of such “objects” that contains 
the degenerate object such that the parameterized family itself is structurally stable. The 
minimum number of parameters needed for a versal deformation is referred to as “the 
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codimension” of the degenerate object, which may be interpreted as a bifurcation point 
in the appropriate space. 

Volume Preserving Map: The map x f{x) is said to be volume preserving if det Df{x) = 

1 . 

Volume Preserving Vector Field: The vector field x — f{x, t) is said to be volume pre- 
serving of V • f{x, t) — 0. Sometimes the phrase “divergence free vector field” is used. 
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